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EDITORIAL 


An opportunity to improve innovation 


he rapid development of COVID-19 vaccines is 
the outcome of Big Pharma’s R&D prowess, bil- 
lions of dollars in federal investment, and the 
dedication and ingenuity of scores of scientists. 
But it also illustrates the logic of the Bayh-Dole 
Act, passed 40 years ago in the United States, 
which governs how universities can work with 
industry to reap the benefits of federally funded re- 
search. The act permits universities to collaborate 
with private companies to license and commercialize 
these technologies under the rationale that the payoff 
for the government’s investment is increased econom- 
ic activity for the country. It is unlikely that the act 
will ever be brought back to the floor of Congress for 
revision, and its staying power suggests that it is a per- 
manent fixture of the U.S. inno- 
vation ecosystem. Nevertheless, 
there are legitimate philosophi- 
cal and logistical objections that 
the incoming administration 
should work to address. 

Over the years, the federal 
government has funded most of 
the basic research that under- 
lies the COVID-19 vaccines. But 
the government itself lacks the 
capacity to carry out massive 
clinical trials or to manufacture 
and distribute the vaccines on 
its own. The Moderna vaccine, 
for example, relies on patents 
that are licensed under Bayh-Dole to the biotechnol- 
ogy company. The fact that vaccines could be available 
to health care workers as soon as this week is a testa- 
ment to the effectiveness of the arrangement. 

When Bayh-Dole first came along, there were no- 
table concerns among university faculties about the 
conflicts that would arise. Would scientists be objec- 
tive about their published research if they also stood 
to gain financially? Would students and postdocs see 
their careers stalled out because results were held 
back while patents were filed or—even worse—results 
were kept secret to protect financial interests? 

Universities set up ways to monitor and correct such 
conflicts, and though there have been problems, the 
system has held up well and contributed to important 
innovation. Still, the maintenance of technology trans- 
fer offices and conflict monitoring have introduced 
costs to conducting research that are not fully com- 
pensated for by the federal government—costs that 
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have taken resources away from other important uni- 
versity priorities. 

When a faculty member holds equity in a startup 
company, their interests are not completely aligned 
with those of the university, which can make negoti- 
ating licenses cumbersome and strained. When I was 
a university administrator, we addressed this problem 
by creating a boilerplate license with standardized 
terms that could be automatically agreed to when a 
company was formed. At the University of North Car- 
olina, we called this the “Carolina Express License” 
and at Washington University, we called it the “Quick- 
Start License.” A few other universities have such poli- 
cies—notably, Carnegie Mellon and the University of 
California, San Diego—but unfortunately, this idea 
has not been widely adopted. 
The rationale against the boiler- 
plate license is that every deal 
is different and needs to be 
separately negotiated, but that 
leaves the problem of those ad- 
ditional administrative costs, 
as well as the risk of damag- 
ing the university’s relationship 
with the faculty entrepreneur. 
A number of faculty departures 
over the years have resulted 
from sour relations caused by 
these negotiations. 

Although Bayh-Dole has pro- 
duced much economic success 
and progress on important fronts, there are major 
drawbacks to depending on the marketplace to spur 
the kind of research that benefits society—a stated ra- 
tionale for passing the act. This disconnect provides 
the strongest argument to create a more public system 
that doesn’t rely on the financial short-sightedness 
of industry collaboration; however, most attempts at 
public solutions to this problem have not led to in- 
novations applied outside the public-private model. 
Thus, the current system does not address what to do 
when there is insufficient financial interest to attract 
solutions to problems like antibiotic resistance or un- 
realized pandemics. As the new Biden administration 
forms in the United States, a productive effort might 
be seeking a means of working within the framework 
of Bayh-Dole to address compelling needs that are not 
market-driven. 


-H. Holden Thorp* 


*H.H.T. declares anumber of competing interests that remain updated on his biography on the Science website. He is a venture partner 
at Hatteras Venture Partners, on the board of directors of Artizan Biosciences, and a consultant to Ancora. 
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A Japan Aerospace Exploration Agency worker retrieves asteroid samples brought back to Earth. 


PLANETARY SCIENCE 


Hayabusa? brings back asteroid bits 


fter lighting up the night sky during its fiery descent through 
the atmosphere, the return capsule of Japan’s Hayabusa2 mis- 
sion parachuted to a soft landing on 6 December in Woomera, 
Australia, bearing soil and rock samples from the asteroid Ryugu. 
Hayabusa2’s remote observations of Ryugu indicated it contained 
organic molecules and water, supporting the theory that similar 
asteroids may have delivered the building blocks of life to Earth billions 
of years ago. The capsule will be opened in clean rooms this week in 
Tokyo to see whether it contains 100 milligrams of material; samples 
will be shared internationally for further analysis. In 2010, the original 
Hayabusa mission was the first to return material from an asteroid to 
Earth. Launched in 2014, its successor, Hayabusa2, has already traveled 
nearly 5.3 billion kilometers. After dropping off the return capsule, the 
craft set off for a 2031 rendezvous with the asteroid 1998 KY26. 
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8848.86 meters 


The height of Mount Everest, which straddles China and Nepal and is the world’s tallest 
mountain, as determined by geological survey teams from both countries. It’s the first time 
the nations have agreed on the measurement, which is slightly higher than previous ones. 


Journal will require preprints 


PUBLISHING | The open-access biology 
journal eLife will only review manuscripts 
that have been posted on preprint servers— 
and will publish all peer reviews even for 
rejected papers, it announced last week. 
Most of its submissions are already posted 
as preprints, so this is the next logical 
step, says longtime open-access publish- 
ing proponent Michael Eisen, a biologist 
at the University of California, Berkeley, 
and the journal’s editor-in-chief. Until July 
2021, authors will be able to opt out of the 
preprint requirement but will be asked to 
explain why. Authors of a rejected manu- 
script may postpone publication of the 
reviews only until the paper is accepted by 
another journal. “We're pulling back the 
curtain on a process we all know is hap- 
pening,” Eisen says. 


Origins of record dust storm 


CLIMATE SCIENCE | A record-setting dust 
storm that in June swept nearly 15 million 
tons of dust from Africa’s Sahara Desert to 
the Americas might be tied to a warming 
Arctic, according to a paper in Geophysical 
Research Letters last week. Formed after 
an unusual high-pressure weather system 
parked over Africa’s northwestern edge, 
the storm’s winds kicked up the dust. The 
pressure system arose from low-latitude 
meanderings in the winds of the polar jet 
stream that normally circle the Arctic. The 
cause of such waverings is debated, but 
some scientists say Arctic warming is to 
blame for the far-reaching effects. 


Congress: Disclose all funding 


U.S. SCIENCE POLICY | Scientists applying 
for federal grants will need to list all exist- 
ing funding sources, according to a bill 
nearing final approval in the U.S. Congress. 
The new rules would unify the current 
hodge-podge of requirements across a 
dozen research agencies. The legislation 
requires all key personnel on a grant appli- 
cation to disclose both public and private 
sources of support, foreign and domestic. 
“We don’t want to stifle research or dis- 
courage partnerships, but we want to know 
about them,” says an aide to the House of 
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The dead do tell tales 


rchaeologists recently unearthed more than 100 elaborately decorated wooden coffins 
at the Saqgara necropolis, a burial complex located 20 kilometers south of Cairo, stacked 
two and three deep at the bottom of deep underground shafts. The finds date back roughly 
2500 years to Egypt's late period, an era separated from the reigns of more familiar pha- 
raohs like Tutankhamun and Ramesses || by more than 700 years of turmoil, civil war, and 
decline. Science spoke to University of Tubingen archaeologist Ramadan Hussein, who was 
not involved with the latest dig, about what the necropolis reveals about the ancient Egyptians. 


Q: Why were there so many 

coffins at this site? 

A: By 650 B.C., Egypt was starting to get 
back on Its feet and become a power in the 
Mediterranean again. Saqqara once more 
becomes the main cemetery for a thriving, 
wealthy city full of temples. Meanwhile, 
there's an intellectual movement to look back 
at Egyptian history and revive its traditions. 
You can see a nostalgia for what was good 


Representatives Armed Services Committee, 
which drafted the language as part of a 
massive bill providing policy guidance to 
the Department of Defense. Legislators 
hope the increased transparency will curb 
China’s moves to obtain U.S. technologies. 


New verdict on diplomats’ illness 


SECURITY | Atop U.S. science body has 
concluded that pulsed radio waves may 
have sickened diplomats in Cuba, lending 
credence to the controversial notion that 
an unknown adversary targeted them with 
a microwave weapon. Starting in 2016, U.S. 
diplomats in Havana and later in China 
reported sudden, loud sounds and pressure 
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in Egyptian history in the cemeteries at 
Saqqara, like inscriptions on the coffins 
replicating religious texts from the walls of 
nearby pyramids. 


Q: What do the burials tell you 

about Egypt at the time? 

A: In the late period, Egypt has started 
becoming an international power again, 
and as a result it's becoming a real mosaic 


in the head, followed by symptoms such as 
dizziness, nausea, and tinnitus. Proposed 
causes have included the Zika virus, insec- 
ticides, or the effects of stress. But a report 
released last week by a panel convened 

by the National Academies of Sciences, 
Engineering, and Medicine describes the 
symptoms as “unlike any disorder.” It 
asserts that directed, pulsed radio energy is 
“the most plausible mechanism” for acute 
symptoms in some of the diplomats. The 
Department of State “needs a strike team 
that can move in quickly when a sudden or 
unusual health event occurs,” adds panel 
member Linda Birnbaum, former director 
of the National Institute of Environmental 
Health Sciences. 
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of ethnicities: There are Phoenicians and 
Greeks and Libyans, and you can see 
their influence in the grave goods. Trade 
connections with Greece are intensifying. 
Many of the coffins at Saqqara are made 
from expensive wood brought in from 
southern Europe and elsewhere around 
the Mediterranean. 


Q: Why were they not found in pyramids? 
A: Death has become a big business. 
Discoveries like this tell us about how 
you administer a cemetery and run the 
business of death. Priests and undertakers 
at Saqqara are selling everything from 
mummification services to burial plots. 
The ideology of death had shifted. People 
weren't focused on the size of their tomb; 
they were happy to be buried in a sacred 
precinct and a nice coffin. 


Cell culture meat wins approval 


BIOTECHNOLOGY | California-based food 
startup Eat Just will be the first to market 
meat grown from cultured animal cells, the 
company announced last week. Following 
approval last month from the Singapore 
Food Agency, the company plans to sell its 
cultured chicken nuggets at an unnamed 
restaurant. The meat is produced by 
extracting muscle cells from a live chicken 
and growing them in large quantities in a 
nutrient-rich bioreactor. Proponents argue 
that cultured meat is more humane than 
farmed meat and could reduce global green- 
house gas emissions, 14.5% of which comes 
from livestock. 
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Tweaking genes with CRISPR 
or viruses fixes blood disorders 


People with sickle cell disease or beta-thalassemia could 
be freed of debilitating symptoms for a lifetime 


By Jocelyn Kaiser 


tis a double milestone: new evidence that 
cures are possible for many people born 
with sickle cell disease and another seri- 
ous blood disorder, beta-thalassemia, and 

a first for the genome editor CRISPR. 
Two strategies for directly fixing mal- 
functioning blood cells have dramatically 
improved the health of a handful of people 
with these genetic diseases, teams reported 
online on 5 December in studies in The New 
England Journal of Medicine (NEJM) and at 
the American Society of Hematology (ASH) 
meeting. One relies on CRISPR, marking the 
first inherited disease clearly helped by the 
powerful tool created just 8 years ago. And 
both treatments are among a wave of genetic 
strategies poised to expand who can get du- 
rable relief from the blood disorders. The 
only current cure, a bone marrow transplant, 
is risky, and matched donors are often scarce. 
The novel genetic treatments have many of 
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the same safety issues as those transplants, 
and may also be extraordinarily expensive, 
but there is hope those risks can be elimi- 
nated and the costs pared down. “This is an 
amazing time, and it’s exciting because it’s 
happening all at once,’ says hematologist 
Alexis Thompson of Northwestern Univer- 
sity, who with a company called Bluebird Bio 
continues to test yet another genetic strategy 
that first demonstrated a sickle cell fix several 
years ago. 

People born with sickle cell disease have 
mutations in their two copies of a gene for 
hemoglobin, the oxygen-carrying protein in 
red blood cells. The altered proteins stiffen 
the normally flexible cells into a sickle 
shape. The cells can clog blood vessels, trig- 
gering severe pain and raising the risk of or- 
gan damage and strokes. Sickle cell disease 
is among the most common inherited dis- 
eases, affecting 100,000 Black people in the 
United States alone. (The sickling mutations 
became widespread in Africa, as one copy 
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Victoria Gray (right), shown with researcher Haydar 
Frangoul, was the first patient to be treated with 
the gene-editing tool CRISPR for sickle cell disease. 


protects blood cells from malaria parasites.) 

People with beta-thalassemia make little 
or no functioning hemoglobin because of 
mutations that affect the same subunit of the 
protein as in sickle cell. About 60,000 babies 
are born each year globally with symptoms of 
the disease, largely of Mediterranean, Middle 
Eastern, and South Asian ancestry. Blood 
transfusions are standard treatment for both 
diseases, relieving the severe anemia they 
can cause, and drugs can somewhat reduce 
the debilitating “crises” that often hospitalize 
sickle cell patients. 

In the two new treatments, investigators 
have tinkered with genes to counter the mal- 
functioning hemoglobin. They remove a pa- 
tient’s blood stem cells and, in the lab, disable 
a genetic switch called BCLI11A that shuts 
off the gene for a fetal form of hemoglobin 
early in life. The patient then receives chemo- 
therapy to wipe out their diseased cells, and 
the altered stem cells are infused. With the 
fetal gene now active, the fetal protein re- 
stores missing hemoglobin in thalassemia. 
In sickle cell disease it replaces some of the 
flawed adult hemoglobin, and also blocks any 
that remains from forming sticky polymers. 

“It’s enough to dilute the effect,’ says 
Samarth Kulkarni, CEO of CRISPR Thera- 
peutics, which partnered with Vertex Phar- 
maceuticals on using the genome editor. 
The companies engineered CRISPR’s DNA- 
cutting enzyme and “guide RNA” to home in 
on and break the BCLIIA gene. A team led 
by gene therapy researcher David Williams of 
Boston Children’s Hospital achieved the same 
goal with a harmless virus that pasted into 
the genomes of harvested blood stem cells a 
stretch of DNA encoding RNA that silences 
the fetal hemoglobin off switch. 

Patients treated in both trials have begun 
to make sufficiently high levels of fetal hemo- 
globin and no longer have sickle cell crises or, 
except in one case, a need for transfusions. In 
one NEJM paper, the Boston Children’s team 
described the success of its virus gene ther- 
apy in six sickle cell patients treated for at 
least 6 months. They include a teenager who 
can now swim without pain, and a young 
man who once needed transfusions but has 
gone without them for nearly 2.5 years, says 
Erica Esrick of Boston Children’s. “He feels 
perfectly normal.” 

The first sickle cell patient to receive 
CRISPR, 17 months ago, a Mississippi mother 
of four named Victoria Gray, has called the re- 
sults “wonderful.” “We have ameliorated the 
symptoms,” says Haydar Frangoul, a hemato- 
logist at the Sarah Cannon Research Insti- 
tute who treated Gray as part of the CRISPR 
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trial. A beta-thalassemia patient treated with 
the genome editor 22 months ago is also 
doing well, the companies report in NEJM. 
Frangoul described positive results for 10 pa- 
tients with these diseases at the ASH meeting. 

The CRISPR results “are really very im- 
pressive,’ says Boston Children’s stem cell 
biologist Stuart Orkin, whose lab discovered 
the BCLI1A switch that led to both trials. (He 
is not involved with either.) 

The results are comparable to those of the 
Bluebird strategy, which relies on adding a 
gene for an adult hemoglobin that has been 
tweaked so it reduces polymerization of the 
sickling form. Among 32 sickle cell disease 
patients who received the treatment within 
the past 3 years, 19 who previously experi- 
enced severe pain crises have had none for 
6 months or more, Thompson reported at the 
ASH meeting. The Bluebird treatment was 
approved in Europe in 2019 for certain beta- 
thalassemia patients, and the firm expects to 
seek U.S. approval for both diseases in the 
next few years. Bluebird Chief Scientific Of- 
ficer Philip Gregory says the long-term data 
for the firm’s methods is an advantage over 
other approaches. “We've set a very high bar.” 

Others who treat these diseases say it’s too 
early to crown a specific genetic treatment 
the winner. For example, reversing the fetal 
hemoglobin off switch, as the new CRISPR 
and RNA-based gene therapy strategies do, 
allows blood cells to make natural levels of 
the protein, unlike Bluebird’s approach. And 
although a virus-carrying gene can land in 
the wrong place and trigger cancer, CRISPR 
could in theory similarly make harmful off- 
target edits. “We need long-term follow-up” 
for all the strategies, says the National Insti- 
tutes of Health’s (NIH’s) John Tisdale, a co- 
leader of the Bluebird study. 

These genetic treatments, as currently 
practiced, seem unlikely to help the many 
patients worldwide who don’t have access to 
sophisticated health care. Bluebird expects to 
charge $1.8 million for its beta-thalassemia 
product in Europe—a sum it bases on a pa- 
tient’s gains in life span and quality of life— 
and the other genetic treatments are likely to 
be similarly expensive. 

Bluebird and other groups are exploring 
whether antibodies, instead of harsh chemo- 
therapy, can wipe out a patient’s mutant cells. 
In a bolder effort to reduce costs and increase 
safety, NIH and the Bill & Melinda Gates 
Foundation last year announced a plan to 
each put at least $100 million into developing 
technologies that would modify blood stem 
cells directly in a patient’s blood marrow. “It’s 
a big hairy goal, but it’s an engineering chal- 
lenge,’ says gene therapy researcher Donald 
Kohn of the University of California, Los An- 
geles, who leads another sickle cell treatment 
trial. “We’ll get there.” 
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Spare a thought for the teeming 
ecosystem beneath your feet 


Global review of soil biodiversity calls for protecting 
often overlooked subterranean life 


By Elizabeth Pennisi 


each down and scoop up some 
soil. Cupped in your hands may be 
5000 different kinds of creatures— 
and as many individual cells as 
there are humans on the globe. That 
random handful might hold micro- 
scopic fungi, decomposing plant matter, a 
whisker-size nematode munching on the 
fungi, and a predatory, pinhead-size mite 
about to pounce on the nematode. One 
bacterium may fend off another with a po- 
tent antibiotic. It’s a whole world of often 
overlooked biodiversity. 

Last week, on the eve of World Soil Day, 
the Food and Agriculture Organization of 
the United Nations released its first ever 
global assessment of the biodiversity in this 
underground world. Some 300 experts have 
pooled their knowledge and data to describe 
the diversity of these organisms, the roles 
they play in both natural and agricultural 
environments, and the threats they face. 








A complex dance among microbes, plants, and 
arthropods and other animals sustains soil ecosystems. 
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“The organisms below ground are argu- 
ably just as important, if not more impor- 
tant, than what’s above ground,” says Noah 
Fierer, a soil ecologist at the University of 
Colorado, Boulder, who did not contribute 
to the report. It details how life in the soil 
boosts crop growth and purifies soil and 
water. Together with plant root systems, 
soil organisms store more carbon, poten- 
tially for longer, than the aboveground 
parts of trees do. “Depending on how we 
handle soil, it could become a help or a 
burden to face the crisis of biodiversity or 
climate change,” says Francisco Pugnaire, 
a soil and plant ecologist at the Spanish 
National Research Council’s Experimental 
Station of Arid Zones. 

Yet with each pass of the bulldozer or 
tractor, each forest fire, each oil spill, even 
the constant traffic of hikers along a popu- 
lar trail, more and more soil organisms are 
being killed off. By compiling research on 
these subterranean ecosystems and how 
they affect visible ones, the report’s au- 
thors hope to convince scientists, policy- 
makers, and the general public to take 
steps to slow this loss. 

“You just can’t have a Mars-like soil and 
expect to maintain the food supply and 
forests,” warns Diana Wall, an ecologist at 
Colorado State University who contributed 
to the report. Current conservation efforts 
are not helping much, she adds. For exam- 
ple, soil biodiversity hot spots aren’t neces- 
sarily in the same place as the biodiversity 
hot spots that conservationists focus on. 
“We are managing [conservation] by what 
we see above ground, which doesn’t nec- 
essarily match what’s below ground.” In 
contrast, Fierer says, “If you preserve 
the soil, you will likely preserve the 
whole ecosystem.” 

Soil is a mix of organic material, miner- 
als, gases, and other components that pro- 
vide the substrate for plants to grow. About 
40% of all animals find food, shelter, or ref- 
uge in soil during part of their life cycle. 

Scientists have mostly focused on the 
largest and smallest soil creatures. For 
centuries, natural historians have ob- 
served the moles, earthworms, ants, and 
termites that chew, wiggle, and dig their 
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way among soil’s particles, feasting on de- 
caying leaves and other debris or on each 
other. Those ecosystem engineers aerate 
the soil and create underground passage- 
ways that make soil more hospitable for 
other life. And over the past few decades, 
microbiologists sequencing soil DNA have 
discovered an astonishing diversity of bac- 
teria and fungi, which process that litter 
into organic material. 

But in between the scales of macro- 
scopic animals and microbes lie thousands 
of long-overlooked tiny creatures—the 
micro- and meso-fauna. Microscopic pro- 
tists, nematodes, and tardigrades inhabit 
the watery films around soil particles. 
Slightly larger animals up to 2 millimeters 
in size, such as mites, springtails, and in- 
sect larvae, live in the airy pores between 
those particles, helping make soil one of 
the most diverse habitats on Earth. “How 
little we know [about these creatures] is a 
bit overwhelming,” Fierer says. 

This diversity creates a rich, complex 
ecosystem that boosts crop growth, breaks 
down pollutants, and can serve as a nearly 
inexhaustible sink for carbon. Some soil or- 
ganisms promote plant diversity and many 
have yielded important compounds, from 
antibiotics to natural pesticides. “Without 
soil organisms and the activities they carry 
out, it would be impossible for other or- 
ganisms to survive,” says Stephen Wood, a 
soil ecologist at the Nature Conservancy. 

Hidden below ground, these ecosystems 
seemed immune to surface disturbance, 
Wood says. “For a long time, soil scien- 
tists thought soil microorganisms were 





so well spread around the world that land 
management would not harm them,” he 
explains. “We now know that soil micro- 
organisms can be very specific to very spe- 
cific habitats and species,’ habitats that are 
disappearing as farms and cities expand. 

The report lists a dozen human activities 
taking a major toll on soil organisms. They 
include deforestation, intense agriculture, 
acidification due to pollutants, salinization 
from improper irrigation, soil compaction, 
surface sealing, fire, and erosion. “If you 
pave over a site, you are sealing off an en- 
tire belowground ecosystem,” Fierer says. 
“And that’s happening all over the globe.” 

A few governments and companies are 
making some progress. Several states are 
considering legislation to help protect 
soils. In China, the Agricultural Green 
Development program works to conserve 
soil by avoiding tilling and by interweav- 
ing different crops to preserve biodiversity. 
“Most organizations want to protect soil 
biodiversity as a means to an end,” Wood 
points out, to benefit people or above- 
ground ecosystems. 

Some researchers hope the report will en- 
courage protecting soil organisms for their 
own sake. “Soil biodiversity is huge, and we 
must not destroy it without knowing what 
potential there is for improving sustainabil- 
ity,’ says Mary Scholes, a biogeochemist at 
the University of the Witwatersrand. 

Fierer thinks the new assessment will 
also awaken a sense of wonder. “My hope is 
people will look at this document and say, 
‘Huh, I never thought about [soil organ- 


999 


isms] and all the things they do for me. 





The soil is alive 


Organisms including ants and termites physically structure soil, while leaf litter creatures and microbes break 
down organic matter. All these organisms, including nematodes, mites, and springtails, are linked in an intricate 


food web (arrows show the flow of nutrients). 









Ecosystem engineers 


Organisms that aerate 
and alter the soil structure 
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Litter transformers 


Micro-, meso-, and 
macroarthropods 
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organic matter 
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PEOPLE 


HIV researcher 
named to lead 


battered CDC 


Biden also picks California 
attorney general for health 
and human services 


By Jocelyn Kaiser 


resident-elect Joe Biden moved this 

week to fill two top health positions. 

California Attorney General Xavier 

Becerra will be nominated as secre- 

tary of health and human services, 

and HIV/AIDS researcher Rochelle 
Walensky of Harvard Medical School and 
Massachusetts General Hospital has been se- 
lected to direct the Centers for Disease Con- 
trol and Prevention (CDO). 

In Congress, Becerra served on a House 
of Representatives committee that oversaw 
Medicare and Medicaid. As California attor- 
ney general, he has led efforts to defend the 
Affordable Care Act and women’s reproduc- 
tive rights. 

Biden’s selection of Walensky drew praise. 
“She’s an outstanding choice. It’s a bright fu- 
ture for the CDC,” says James Curran, dean of 
the Rollins School of Public Health at Emory 
University, a former director of CDC’s HIV/ 
AIDS division. A physician and epidemiolo- 
gist, Walensky’s specialties include promoting 
HIV testing and modeling the cost effective- 
ness of using drugs to prevent HIV infections 
in high-risk groups. This year, she has advo- 
cated strong action to control and monitor 
the spread of the pandemic coronavirus. 

Walensky is “an incredible leader in infec- 
tious diseases globally and particularly in 
HIV,” as well as a “clear communicator” on 
COVID-19, says Jen Kates, director of global 
health and HIV policy at the Henry J. Kaiser 
Family Foundation. 

CDC has struggled to issue clear public 
health messages about COVID-19 amid pres- 
sure from the Trump White House to down- 
play the pandemic. Walensky will need to 
help oversee an unprecedented vaccination 
campaign, and “she will face enormous chal- 
lenges in upgrading the scientific capabilities 
and the credibility at CDC,’ says Barry Bloom, 
a researcher and former dean of the Harvard 
T.H. Chan School of Public Health. “I have 
no doubt she will provide the leadership re- 
quired,” he adds. 
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Tasmanian devils claw their way 
back from extinction 


Genomes show spread of deadly cancer is slowing 


By Elizabeth Pennisi 


or decades a ghastly facial cancer 

has been decimating Tasmanian dev- 

ils. Spreading from animal to animal 

when the stocky, raccoon-size marsu- 

pials bite each other, the transmissible 

cancer has killed up to 80% of the 
devils in Tasmania, their only home for mil- 
lennia. Some researchers saw extinction as 
inevitable. Now, a new study, reported on 
p. 1293, suggests the remaining 15,000 dev- 
ils have reached a détente with the cancer. 
Until recently it was spreading exponen- 
tially, like the pandemic coronavirus among 
humans in many parts of the world. But ge- 
neticists calculate that each infected devil 
now transmits tumor cells to just one—or 
fewer—other devils. That could mean the 
disease may disappear over time. 

“It is a promising sign for the future,” says 
Gregory Woods, an immunologist at the 
University of Tasmania, Hobart, who was 
not involved with the work. Along with two 
other recent studies, the findings hint that 
changes in devil behavior—and possibly the 
emergence of less virulent tumor cells—may 
be taming the cancer’s spread, and that des- 
perate efforts to breed the devils in captivity 
may not be needed. “[This] deeper under- 
standing of the links between host behavior 
and infectious disease may help reveal new 
insights that can help both devils and other 
wildlife facing emerging disease threats,” 
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says Vanessa Ezenwa, a disease ecologist at 
the University of Georgia, Athens. 

Transmissible cancers are rare in mam- 
mals, and devils—whose nocturnal screams 
and growls earned them their name—are 
mostly solitary. But starting in 1996, re- 
searchers began to notice more and more 
devils with tumors. Sick animals infected 
others with cancer cells during mating sea- 
son fights and scuffles over scavenged car- 
casses, triggering rapid spread of devil facial 
tumor disease (DFTD). Tens of thousands 
of devils died, and conservationists estab- 
lished captive breeding programs to create 
a reserve population for reintroduction. 

The dynamics of the disease are complex, 
according to a recent study led by cancer 
geneticist Elizabeth Murchison of the Uni- 
versity of Cambridge. Her team genetically 
analyzed more than 600 tumor samples col- 
lected between 2003 and 2018 and found 
five genomic versions, three of them wide- 
spread, with some devils contracting mul- 
tiple types. That complexity could hamper 
efforts to develop vaccines to conquer the 
cancer, according to their 24 November 
study in PLOS Biology. 

To further investigate the spread of the 
rogue tumor cells, Washington State Univer- 
sity, Pullman, geneticist Andrew Storfer and 
his graduate student Austin Patton exam- 
ined differences in tumor genomes through 
time, an approach routinely used to trace 
the spread of viruses including SARS-CoV-2, 
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Biting during mating can transmit a deadly cancer 
from one Tasmanian devil to another. 


the pandemic coronavirus. A mammalian 
tumor genome is much larger than that of a 
virus, so Patton and colleagues had to work 
out ways to analyze their data, gleaned from 
51 tumors collected from 2003 to 2018. 

Focusing on 28 genes that seemed to be 
evolving at a consistent rate, they traced 
how specific mutations spread through 
the tumor samples over time. That enabled 
them to infer the rate at which the cancer 
itself was spreading among devils. “The ap- 
plication of these methods for transmissible 
cancer is very clever,” says Michael Metzger, 
a molecular biologist at the Pacific North- 
west Research Institute. 

At the disease’s peak in the early 2000s, 
each infected devil spread the disease to at 
least 3.5 others, the team reports in Science. 
But transmission has slowed recently, with 
some infected animals not passing DFTD 
on at all. The reduced density of devils ac- 
counts for much of the decline, Patton sug- 
gests, aS animals come into contact with 
fewer of their fellows. Remaining devils may 
also have better immune systems or altered 
behavior, speculates Patton, now a postdoc 
at the University of California, Berkeley. 

A study out on 9 December in the Pro- 
ceedings of the Royal Society B supports 
the idea that an animal’s behavior can slow 
transmission. Disease ecologist Rodrigo 
Hamede and behavioral ecologist David 
Hamilton from the University of Tasmania, 
Sandy Bay, did devil contact tracing: For 
6 months they put radio collars on 22 dev- 
ils that revealed when an animal came into 
close contact with another. The tracking 
showed that once infected, even dominant, 
aggressive devils withdrew from others as 
they became sicker. These individuals were 
“superspreaders” only early in the mating 
season, Hamilton, Hamede, and their col- 
leagues report. “The fact that they behave 
in this way is likely to have a big impact on 
disease dynamics,’ Hamilton says. 

The Science study authors argue against 
plans to introduce captive-bred devils into 
remaining wild populations. Beefing up 
devil populations may increase their density 
and rev up transmission again, and captive- 
bred animals may lack resistance built up in 
wild populations, Storfer speculates. 

Though this week’s news is good, “devils 
are still not out of the woods,” warns con- 
servationist Max Jackson of Aussie Ark, 
which helps breed captive devils. Indeed, 
researchers detected a second transmissible 
facial cancer in devils in 2014. But the new 
findings offer hope, Hamilton says. “It looks 
extremely unlikely that we'll be losing them 
any time soon.” 
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Scientists rally around plan for fusion power plant 


Road map calls for Department of Energy to fund applied as well as basic research 


By Adrian Cho 


S. fusion scientists, notorious for 

squabbling over which projects to 

fund with their field’s limited budget, 

have coalesced around an audacious 

goal. A 10-year plan presented last 

week to the federal Fusion Energy Sci- 

ences Advisory Committee is the first since 

the community tried to formulate such a 

road map in 2014 and failed spectacularly. 

It calls for the Department of Energy (DOE), 

the main sponsor of U.S. fusion research, to 

prepare to build a prototype power plant in 

the 2040s that would produce carbon- 

free electricity by harnessing the nuclear 
process that powers the Sun. 

The plan formalizes a goal set out 

2 years ago by the National Academies 

of Sciences, Engineering, and Medicine 

(Science, 21 December 2018, p. 1343) 


the room” couldn’t dominate deliberations. 

The process was also comprehensive, 
says Carolyn Kuranz, a plasma physicist 
at the University of Michigan, Ann Arbor. 
FES mainly funds research on magnetically 
confined fusion, in which an ionized gas or 
plasma is squeezed and heated until atomic 
nuclei fuse and release energy. But it also 
supports smaller efforts in plasma physics, 
such as using high-power lasers to re-create 
plasmas like those in stars. The consensus 
building did not neglect them. “This was 
the first time we included the whole portfo- 
lio and the entire community,” Kuranz says. 


Fusion wish list 

U.S. researchers have agreed on the need for projects that would 
aid a future power plant (gray) and advance basic plasma science. 
However, funding limits could curtail plans. 


leveraging recent advances such as super- 
computer simulations of entire tokamaks, 
3D printing, and magnet coils made of 
high-temperature superconductors. 

The new fusion road map identifies tech- 
nological gaps and nearer-term facilities to 
fill them (see partial list, below). “By identify- 
ing [a power plant] as a goal, that can trigger 
more research in those areas that support 
that mission,” says Stephanie Diem, a fu- 
sion physicist at the University of Wisconsin, 
Madison. For example, in a fusion power 
plant a barrage of energetic neutrons would 
degrade materials, so the report calls for 
developing a particle-accelerator-based 
neutron source to test new ones. 

Such technology development pushes 
a sensitive boundary for the fusion pro- 
gram. Fusion investigators have long 
complained that DOE’s Office of Sci- 
ence has limited them to basic research. 
Now, DOE leaders are more receptive 





to a practical approach, says James Van 
Dam, DOE’s associate director for FES. 
“There’s been much more openness and 
interest in fusion moving ahead.” 

To realize their ambitions, fusion 
scientists will need more funding from 
Congress. The planning committee 
considered three scenarios: flat bud- 
gets, increases of 2% per year, and un- 
constrained budgets. Only the most 
generous scenario would allow DOE 
to build new facilities, the report says. 
FES’s annual budget is now $671 mil- 





lion, including $247 million for ITER. 
Tighter budgets might strain the new- 





FLAT 2% 

and embraced in a March report froma PROJECT BUDGETS INCREASES UNCONSTRAINED 
15-month-long fusion community plan- Neutron source to Yes, but Yes, but Yes 
ning process. It also represents a subtle test materials for highly delayed 
but crucial shift from the basic research fusion power plant delayed 
that officials m DOE’s Office of Science eer No E Vac 
have favored. “The community urgently integrated systems highly 
wants to move forward with fusion on for fusion power plant delayed 
a time aa that can impact climate elie No No Ves 
change,” says Troy Carter, a fusion phys- “blanket that would 
icist at the University of California, Los surround reactor and 
Angeles, who chaired the planning com- absorb neutrons 
mittee. “We have to get started” Matter in Extreme No,but No, but Yes 

Fusion scientists and DOE officials Conditions Upgrade develop develop 
strived to avoid the sort of meltdown further further 
they suffered during their last planning Solar wind facility No No Was 
exercise. Six years ago, the fractious 
community was already reeling from  Multipetawattlaser No No Yes 


budget cuts that forced DOE’s Fusion 
Energy Sciences (FES) program to shutter 
one of three major experiments. Then, the 
associate director for FES decided to write 
the plan himself, with limited input. Many 
researchers rejected the road map (Science, 
19 December 2014, p. 1436). 

This time, DOE wants no infighting. 
“We've been told in no uncertain terms that 
either you guys get in line, or you're going 
to get nothing,” says Nathan Howard, a fu- 
sion physicist at the Massachusetts Insti- 
tute of Technology. For the first time, FES 
leaders let researchers hash out consensus 
in a series of workshops and meetings. 
Howard and other leaders of that process 
used anonymous polling and even hired a 
facilitator to ensure the “loudest voices in 
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The plan that emerged does not call for 
a crash effort to build the prototype power 
plant. During the next decade, fusion re- 
searchers around the world will likely have 
their hands full completing and running 
ITER, the international fusion reactor un- 
der construction in southern France. ITER, 
a huge doughnut-shaped device called a to- 
kamak, aims to show in the late 2030s that 
fusion can produce more energy than goes 
into heating and squeezing the plasma. 

ITER will teach valuable lessons about a 
“burning plasma,” researchers say. But they 
add that its cost of more than $20 billion is 
far too steep for an actual power plant. So, 
after ITER, U.S. fusion researchers want to 
build a much smaller, cheaper power plant, 
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found consensus. Plasma physicists want 

several new facilities, such as one to sim- 
ulate the solar wind. But without a funding 
boost, they won’t even be able to build a proj- 
ect DOE has already said it wants: the Mat- 
ter in Extreme Conditions Upgrade, which 
would improve a petawatt laser at the SLAC 
National Accelerator Laboratory to create 
energetic plasmas so they can be probed 
with the lab’s x-ray laser. 

No matter how things play out, the fusion 
plan expresses the will of younger scientists 
who led the community exercise, says Scott 
Baalrud, a plasma theorist at the University 
of Iowa. “People don’t get into this career 
just to study the science that may one day, 
long after they’re dead, lead to a fusion re- 
actor,” he says. “They want to get going and 
change the world.” 
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Europe ramps up defense R&D 


A €7 billion fund is launched to bolster meager 
military spending by EU nations 


By Nicholas Wallace 


his summer, in a leafy, wooded area 
near Utrecht, the Netherlands, scien- 
tists were testing out battlefield haute 
couture: adaptive camouflage. Re- 
searchers with the Netherlands Organ- 
isation for Applied Scientific Research 
mounted a swath of fabric on a stand and 
watched as its pattern shifted to match the 
greens and browns of the foliage. Cameras 
connected to the fabric picked 
up the scenery and hundreds of 
embedded light-emitting diodes 
mimicked it, like the skin of a 
chameleon. The team is testing 
other materials to weave into the 
futuristic camouflage, including 
polymers that absorb body heat 
and radio waves, making soldiers 


€90 million, also supports research in artifi- 
cial intelligence (AI) for bomb detection, laser 
weapons, railguns, and drones. It is a sign of 
much bigger things to come: Next year, PADR 
will be rolled into the new European Defense 
Fund (EDF), with a budget of €7 billion over 
7 years, split between early-stage research 
and late-stage development. 

That’s tiny compared with the $80 billion 
per year the United States spends on defense 
R&D. And it’s even small compared with the 


Going great guns 
Most EU nations, with the exception of France, spend very little on defense R&D. 
EU officials hope a new defense R&D fund, worth about €1 billion a year, will 

boost national spending. 





Drones will be one of the research topics supported 
by a growing European Defense Fund. 


combined €5 billion or so spent on defense 
research each year by EU nations. But the 
European Union, which has no military re- 
sources of its own, hopes the EDF, by topping 
up joint national investments, will encourage 
its members to strengthen their modest de- 
fense capabilities. For European researchers, 
the spending is opening new opportunities— 
and stirring some qualms. 

European governments slashed defense 
budgets in the 1990s, believing the danger 
of major conflict in Europe had ended with 
the Cold War. “It was a bit of a paradox, be- 
cause of course we had the Balkan wars,” says 
Zdzistaw Krasnodebski, a member of the 
European Parliament for Poland’s govern- 
ing Law and Justice Party who steered the 
EDF through Parliament. But recent events— 
particularly Russia’s annexation of Crimea in 
2014 and the subsequent war in Ukraine— 
have changed many minds, he says. 

At the same time, the United States is 
withdrawing from its role as guarantor of 
European security, says Julia Muravska, a re- 
searcher at the RAND Corporation, a global 
defense policy think tank. The Obama ad- 
ministration shifted U.S. defense resources 
to the Far East, and both it and the Trump 
administration have announced troop with- 
drawals from Europe. Successive U.S. presi- 
dents have urged the 21 EU countries that 
are also North Atlantic Treaty Organization 
members to honor commitments to spend at 
least 2% of their gross domestic product on 
defense. But only three actually do: Latvia, 
Estonia, and Greece. 

To address these gaps, European lead- 
ers are discussing ways to pool military re- 
sources, and French President Emmanuel 
Macron has even called for “a 
true European army.’ The EDF 
aims to be another stimulant for 
collaboration. Every EDF proj- 
ect must include participants 
from at least three nations, and 
mandatory cofinancing for late- 
stage development projects will 
give the budget a “lever effect,” 





harder to detect with thermal 
imagers and radars. 

Just as striking as the fabrics is 
the project’s funding source: the 
European Union, better known 
for trade rules and farm subsi- 


says Frederic Mauro, a Brussels- 
based lawyer specializing in 
defense. Even though the EDF 





is minuscule compared with 
U.S. spending, it “can make a 
huge difference at the European 





dies than military maneuvers. 
The camouflage work is part of 
a Swedish-led, six-country proj- 
ect that received a €2.6 million 
grant from the union’s Prepara- 
tory Action on Defence Research 
(PADR). The 3-year fund, worth 


Defense R&D spending (€ billions) 
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scale,’ he says. 

One unanswered question is 
whether researchers in non-EU 
countries such as the United 
Kingdom and Switzerland can 
apply for EDF funding, and on 
what terms. If legislators allow 


2018 
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their participation, the rules will likely be 
stricter than for Horizon 2020, the European 
Union's civilian research program, which 
already includes non-EU nations like Swit- 
zerland and Israel. Jean-Francois Ripoche, 
R&D chief for the European Defence Agency, 
which runs PADR, says the argument boils 
down to whether a foreign firm or institu- 
tion can reliably contribute to EU military 
projects without interference from its home 
country. “In the defense business, it’s hard to 
do without talking to your own government,” 
he says. 

PADR funding has gone to a mix of re- 
search institutes, including Germany’s 
Fraunhofer Society; engineering companies 
such as French giant Thales; weaponsmak- 
ers like MBDA; and smaller businesses, such 
as Space Applications Services, a Belgian 
research firm. Universities also participate: 
For example, the University of Siena leads 
another camouflage project. 

Future EDF research topics will be speci- 
fied in annual calls run by the European 
Commission, the EU executive branch, and 
approved by a committee of national del- 
egates. AI will be a big topic, Ripoche says. 
He says EDF funding will also go to new ma- 
terials, such as discreet metamaterial anten- 
nas that can be engineered into the surfaces 
of vehicles and weapons. Muravska says she 
expects “a healthy take-up” in the EDF by Eu- 
ropean academic researchers, “provided they 
are aware of it.” 

But some scientists are uneasy. “Military 
research can feed into technologies which are 
then exported to countries with poor human 
rights records,” says Stuart Parkinson, direc- 
tor of Scientists for Global Responsibility, a 
U.K. advocacy organization. He adds that nu- 
clear arms races could be stoked by develop- 
ment of ostensibly nonnuclear technologies, 
such as hypersonic weapons. 

Reiner Braun, a board member of the 
Berlin-based International Network of Engi- 
neers and Scientists for Global Responsibility, 
says he wishes EDF money could be used for 
research into understanding and defending 
against poverty, climate change, and disease, 
which can also lead to conflict. “We need 
much more research for peaceful activities 
for supporting the sustainable development 
goals, for climate research, and for many 
other purposes, including the fight against 
the pandemic.” 

European security should come through 
dialogue with Russia and multilateral disar- 
mament, Parkinson says. But he recognizes 
that a weapons-free world remains a far- 
off dream. “I’m not saying we should throw 
the doors open and throw our weapons on 
the floor” m 


Nicholas Wallace is a journalist in Brussels. 
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EAT 
for more reliable 
research 


Psychology’s replication crisis 
inspires efforts to expand samples 


and stick to a research plan 


By Cathleen O’Grady 


cologists love to study blue tits. The 
birds readily nest in boxes in the wild 
and have striking plumage that seems 
ideal for testing ideas about the evo- 
lutionary point of the ornamentation. 
Dozens of studies have reported that 
male coloring is substantially different from 
that of females, that females choose mates 
based on differences in that coloring, and 
that male plumage is a signal of mate quality. 

But Tim Parker, an ecologist at Whitman 
College, wasn’t so sure. In a 2013 meta- 
analysis of 48 studies on blue tit plumage, 
Parker found many researchers had cherry- 
picked the strongest findings from data 
they had sliced and diced. They had worked 
backward from results to form hypotheses 
that fit the data. And reams of boring, nega- 
tive results were missing from the published 
picture. There was no reason to think these 
problems were limited to blue tits, Parker 
says: “I just became convinced that there was 
a lot of unreliable stuff out there.” 

Parker soon found an ally in Shinichi 
Nakagawa, an ecologist at the University 
of New South Wales with similar concerns. 
“It’s an existential crisis for us,’ Nakagawa 
says. The two began to publish on the is- 
sue and gathered more collaborators. That 
has culminated with the launch last week 
of the Society for Open, Reliable, and Trans- 
parent Ecology and Evolutionary Biology 
(SORTEE), dedicated to connecting eco- 
logists who want to add rigor to their field. 
SORTEE draws inspiration from the Society 
for the Improvement of Psychological Sci- 
ence (SIPS)—another discipline that has 
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wrestled with reliability. SORTEE plans to 
host satellite events at ecology conferences 
and, eventually, its own meetings. 

Although SORTEE’s agenda will be set by 
its members, Parker says that, like SIPS, it 
could offer statistics training, build collabo- 
rations, and support metaresearch on the 
health of the discipline. Yolanda Wiersma, 
a landscape ecologist at the Memorial Uni- 
versity of Newfoundland who is not involved 
with SORTEE, is eager to see whether the so- 
ciety makes a difference. Research credibility 
is something “we haven’t wrapped our heads 
around completely as ecologists,” she says. 

Ecology suffers from many of the same 
underlying problems as psychology. Surveys 
of the ecology literature have found that 
small sample sizes are common, often driven 
by high cost or limited access to a species 
or other study system. In landscape eco- 
logy, each landscape is unique, meaning the 
sample size is one, Wiersma says. “There is 
one Yellowstone park,” she says. “There’s one 
Lake District.” Small samples lead to erratic 
results that sometimes miss the effects re- 
searchers are looking for and other times hit 
on noise that looks like a real signal. 

Worsening those problems are “question- 
able research practices,’ says Fiona Fidler, 
a metascientist at the University of Mel- 
bourne. In a 2018 study published in PLOS 
ONE, Parker, Fidler, and colleagues reported 
on a survey of more than 800 ecologists and 
evolutionary biologists. About half of the re- 
spondents said they sometimes presented 
unexpected findings as if they confirmed a 
hypothesis they’d had all along, and about 
two-thirds said they sometimes reported 
only significant results, leaving out negative 
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Flaws in studies about blue 
tit plumage motivated a push 
for more rigor in ecology. 





ones. Together, these forces mean a literature 
overflowing with potentially dubious results, 
Parker says. It’s a “house of cards.” 

But unlike psychology, in which research- 
ers have tried to replicate famous studies and 
failed in about half the cases, ecology has 
no smoking gun. A 2019 PeerJ study found 
only 11 replication studies among nearly 
40,000 ecology and evolution biology papers 
—and only four of these 11 studies managed 
to replicate the original finding. It’s hard to 
replicate ecology studies, Parker says, because 
it often entails expensive and difficult data 
gathering in remote places or over long time 
frames. And ecosystems are so complex that 
any number of variables could affect the out- 
come of a repeat experiment—like the age of 
the organisms in the study, the temperatures 
at the time, or the presence or absence of pol- 
lutants. “No man can step into the same river 
twice because it’s not the same man and it’s 
not the same river,” says Phillip Williamson, 
an ecologist at the University of East Anglia 
who has criticized a high-profile effort to rep- 
licate ocean acidification research. 

Yet Williamson doesn’t think ecology as 
a whole is at risk just because some experi- 
ments fail to replicate. “Biology isn’t physics,” 
he says. “I think that the consensus of science 
gets there eventually.” Parker takes a harder 
line. “If we don’t expect anything to replicate, 
why do we bother doing any of this?” he asks. 

Even before they set up SORTEE, Parker 
and his corevolutionaries were pushing for 
change. They worked with journal editors 
to create checklists for details that papers 
should include—like whether researchers 
were blinded to the conditions of different 
subject groups. They’ve also set up a pre- 
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print server that Nakagawa hopes will help 
preserve results that never make it into jour- 
nals. Julia Jones, a conservation scientist at 
Bangor University who is not involved with 
SORTEE, is advocating for preregistration, 
which forces a researcher to commit to a data 
collection plan and hypothesis before the 
study begins. Some journals offer registered 
reports—peer-reviewed preregistrations 
with a commitment to publish the results, 
however dull or dazzling. Preregistration 
isn’t always possible, because the vaga- 
ries of fieldwork often force researchers to 
change plans. But she says it can help scien- 
tists avoid the “siren song” of looking for a 
clean story in messy data. 

In April, Jones and her colleagues pub- 
lished the first registered report for the jour- 
nal Conservation Biology. She analyzed extra 
data from a randomized controlled trial in 
Bolivia’s highlands that had already found 
that paying farmers to keep their cattle out 
of rivers did not improve water quality. Jones 
found other interesting behavior changes— 
farmers kept their cattle on their farms 
rather than roaming the forests—but many 
results were statistically insignificant. In a 
normal review process, “we would have been 
forced to cherry-pick and tell a much simpler 
story,’ she says. 

Others are working to address the sample 
size problems, by gathering massive amounts 
of data using consistent methods. They hope 
the data sets will make it easier to see which 
findings apply beyond a single ecosystem. 
The U.S. National Ecological Observatory 
Network (NEON), a continentwide program 
of more than 100 heavily instrumented field 
sites, became fully operational in 2019, and 
the first studies drawing on its data are now 
underway. 

The Nutrient Network (NutNet), co- 
founded by University of Minnesota, Twin 
Cities, ecologist Elizabeth Borer, also pulls 
in large amounts of standardized data, to 
explore how changes in nutrients and herbi- 
vores affect grassland plant diversity. Rather 
than building infrastructure like NEON, 
NutNet gets research teams around the 
world to perform the same experiments— 
in return for access to a huge data set. Unify- 
ing experiments is hard, Borer says. For in- 
stance, the team discovered that the fertilizer 
brand Micromax had slightly different micro- 
nutrient mixes on different continents, forc- 
ing researchers to import or mix their own. 

Borer, Wiersma, and Jones are all sym- 
pathetic to SORTEE’s aims—and curious to 
see whether it takes off. Like the systems 
they study, ecologists can be fragmented, 
and developing sound research principles 
sometimes seems impossible, Wiersma 
says. “But I think maybe we could,” she says. 
“We just need to try harder.” 
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Mexico's 
controversial 
coronavirus 
czar 


Hugo López-Gatell 
Ramírez’s response has 
drawn criticism and praise 


By Rodrigo Pérez Ortega 


here’s hardly a Mexican who doesn’t 

know Hugo Lopez-Gatell Ramirez by 

now. Mexico’s undersecretary of pre- 

vention and health promotion has sat 

across from reporters at 7 p.m. sharp 

almost every single night since late 
February to update them, and the country, on 
the toll of the coronavirus pandemic. His firm 
demeanor, careful speech, and courteous per- 
sonality have made his televised coronavirus 
press briefings even more popular than those 
of the country’s president. 

But as COVID-19 deaths in Mexico con- 
tinue to soar—surpassed only by the United 
States, Brazil, and India—many have ques- 
tioned Ldopez-Gatell Ramirez’s leadership. 
Critics accuse him of undercounting the true 
numbers and mishandling the nation’s re- 
sponse. In early August, the governors of nine 
Mexican states demanded his resignation. 
His defenders, though, say he’s making sound 
decisions based on science and doing the best 
he can with the resources at his disposal. 

Lopez-Gatell Ramirez says the country’s 
chronically underfunded public health sys- 
tem has complicated efforts to track the 
disease and coordinate the response. But he 
insists the government’s approach—which 
includes forgoing widespread testing in 
favor of a more targeted approach—has 
been based on science. Many critics, he 
says, are politically motivated. He’s been 
sleeping 3 to 4 hours a night since the pan- 
demic started, he says, and he regrets the 
disruption to his family life. “The mission 
calls me and until I deliver results—I hope 
favorable—I cannot stop.” 

Fame might be new to LOopez-Gatell 
Ramirez, but pandemics are not. Just 2 years 
after finishing his Ph.D. in epidemiology in 
2006 at Johns Hopkins University, he became 
head of epidemiology at the Mexican health 
ministry. One year later, in 2009, the swine 
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flu pandemic struck Mexico, and López- 
Gatell Ramirez helped lead the response. 

The HINI virus, which caused panic 
around the world, likely originated on a pig 
farm in Mexico. Mexico City was shut down 
for 2 weeks, and the country scrambled to 
curb the virus’ spread. Lopez-Gatell Ramirez 
says the experience taught him many lessons, 
including the dangers of poor coordination 
among government institutions and “politi- 
cal opportunism” by officials trying to use the 
crisis to their advantage. This time, he says, 
President Andrés Manuel López Obrador has 
given his team decision-making power and 
protected them from political meddling. 

Faced with the new pandemic, 
López-Gatell Ramírez placed his trust 
in the country’s “sentinel” system, 
originally designed for flu outbreaks. 
In contrast to the massive testing and 
contact tracing efforts in many other 
countries, the system relies on a small, 
strategic, nationwide sample of tests, 
which are then used to predict the 
spread of a disease. As a result, even 
though Mexico’s flagship diagnostics 
laboratory—the Institute of Epide- 
miological Diagnosis and Reference 
(InDRE)—developed the world’s first 
polymerase chain reaction protocol to 
detect SARS-CoV-2, the country ranks 
among the world’s lowest in per-capita 
testing. Mexico has performed about 
17 tests per thousand people in total, 
whereas the United States has done 
more than 30 times that many. 

Reliance on the sentinel model 
may have prevented LOopez-Gatell 
Ramirez and his team from making 
accurate predictions. They forecast 
the pandemic would peak around 
8 May, with 4500 daily cases, aided by 
a national voluntary lockdown in ef- 
fect from late March until June. (The 
lockdown was voluntary because many 
Mexicans live in poverty and rely on 
daily work to survive.) Instead, daily 
cases kept climbing, to a peak of 9556 
on 3 August. Since then, the number has 
fluctuated between 3000 and 12,000, and the 
pandemic is worsening. The official death 
count now exceeds 110,000, nearly twice the 
model’s original worst case. 

Researchers, politicians, and national 
and international media have suggested the 
true toll is even greater. As in almost every 
country, not every COVID-19 death is reg- 
istered. But the rift between statistics and 
reality may be particularly wide in Mexico. 
In late November, officials released a report 
of excess deaths from all causes, suggesting 





Science's COVID-19 reporting is supported by the 
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nearly 156,000 of them could be attributed to 
COVID-19 in people who were never tested. 
The rate of positive tests recently hit 47%, evi- 
dence of a rampaging epidemic. 

Some observers think scarce funding 
was the real reason for the limited testing. 
Alejandro Macias, an infectious disease phy- 
sician and Mexico’s former H1N1 czar, doubts 
López-Gatell Ramírez truly believed the ap- 
proach would be successful. “He got trapped 
in a system in which the InDRE was not given 
sufficient resources or all the money that was 
going to be needed for testing,’ Macias says. 

“You can buy thousands of tests, but if 
you don’t have the personnel [to perform 





“The mission calls me and until | deliver 
results—I hope favorable—I cannot stop.” 


Hugo Lopez-Gatell Ramirez, Mexico's undersecretary of 
prevention and health promotion 


them], you can’t do much,” adds Celia 
Mercedes Alpuche Aranda, an infectious dis- 
ease researcher at the National Institute of 
Public Health and former director of INDRE. 
Ldopez-Gatell Ramirez’s strategy, she says, is 
adapted to the country’s reality: too many 
people to test and not enough infrastructure. 

Testing isn’t the only area where López- 
Gatell Ramírez has drawn criticism. As Eu- 
rope and the United States were dealing with 
their first waves of infections, he advised 
against closing Mexico’s borders. Doing so 
would harm the economy, he said, but would 
not prevent the pandemic from arriving—a 
view shared by the World Health Organiza- 
tion and others at the time. Many epidemio- 
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logists now concede that closing borders can 
help stem viral spread. 

His stance on face masks is perhaps his 
most controversial: Despite growing pres- 
sure from the public, politicians, and scien- 
tists citing evidence that masks can reduce 
viral transmission, López-Gatell Ramírez still 
hasn’t mandated their use nationwide. Masks 
would give a “false sense of security” and lead 
people to relax other measures such as hand 
washing and keeping a healthy distance, he 
said early in the pandemic. Even as evidence 
of the benefits of wearing masks became 
clear, he still has not fully come around, crit- 
ics say. Some Mexican states, however, have 
imposed their own mask mandates. 

Macias suspects López-Gatell 
Ramirez’s statements on masks have 
been influenced by the views of his 
boss, López Obrador, who refuses to 
wear a face mask in public and con- 
stantly downplays the pandemic. 
“He had to defend the indefensible,’ 
Macias says. 

Despite the adversities, the Mexi- 
can health system has scored some 
victories under LoOpez-Gatell Ramirez. 
Starting with a deficit of more than 
310,000 health care workers, he says, 
the system hired more than 47,000 
people in a matter of months and 
tripled the number of intensive care 
beds with ventilators. 

“T think he’s the right person to be in 
that position,” says infectious disease 
physician Gustavo Reyes Teran, who 
leads the country’s National Institutes 
of Health and its affiliated hospital 
network. The institutes and hospitals 
Teran coordinates avoided becom- 
ing overwhelmed, he says, thanks to 
Lopez-Gatell Ramirez. “That, to me, 
has been one of the most important 
successes here,” he says. (Media have 
reported, however, that many Mexi- 
cans died at home without setting foot 
in a hospital.) 

L6opez-Gatell Ramirez has also taken 
action against one cause of the virus’ toll in 
Mexico: high rates of hypertension, obesity, 
and diabetes, which can make severe dis- 
ease more likely. In response, LOpez-Gatell 
Ramirez has renewed his long-standing cam- 
paign against junk food and sugary drinks, 
calling them “bottled poison.” 

“He is a person with good intentions to do 
things well, to be based on science,’ Alpuche 
Aranda says. “He’s doing the best he can,” 
Macias adds. 

How history judges Lopez-Gatell Ramirez, 
however, may ultimately depend on the pan- 
demic’s toll on his nation. 





Rodrigo Pérez Ortega is a journalist in Mexico City. 
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Beijing-based Sinovac, which makes a COVID-19 vaccine from inactivated virus, has efficacy trials underway in Brazil, Turkey, and Indonesia—but not in China. 





CHINA'S VACCINE GAMBIT 


With its global campaign to test and promote 
COVID-19 vaccines, China aims to win friends and cut deals 


he first people in the world to re- 
ceive a COVID-19 vaccine were not 
part of a clinical trial. No TV sta- 
tions or newspapers covered the 
historic event. No company issued 

a statement. 
On 29 February, less than 
2 months after the world awakened 
to the threat of the new disease, 
virologist Chen Wei, a major general in 
China’s army, and six military scientists on 
her team stood in front of a Chinese Com- 
munist Party flag and received injections of 
an experimental COVID-19 vaccine. Chen, 
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By Jon Cohen 


a national hero for her work on Ebola vac- 
cines, had come to the initial center of the 
pandemic, Wuhan, with her group from the 
Academy of Medical Military Sciences, in 
part to help make the candidate vaccine with 
pharmaceutical company CanSino Biologics. 
Commentators inside and outside of China 
later questioned whether the event, which 
received wide play on social media, was real. 


Science's COVID-19 reporting is supported by the 
Pulitzer Center and the Heising-Simons Foundation. 
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No less than People’s Daily, the Communist 
Party’s main newspaper, labeled a photo 
of Chen receiving the vaccine as “#FAKE- 
NEWS.” But Hou Li-Hua, a researcher at the 
academy who works on the vaccine project, 
says it was “true news’—an attempt to pro- 
tect the scientists in the hard-hit city. 

In the United States, the Trump adminis- 
tration’s $10.8 billion Operation Warp Speed 
accelerated vaccine R&D faster than many 
researchers thought possible, specifically for 
the U.S. population. But an equally massive 
effort has unfolded in China. CanSino and 
two other Chinese companies—one owned 
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by the government, the other working closely 
with its regulatory agency—are investing sub- 
stantial resources, testing four candidates in 
tens of thousands of volunteers around the 
world. They are likely only days or weeks 
away from announcing the outcomes of ef- 
ficacy trials, just behind the encouraging 
early results recently announced by a brace 
of companies and institutions outside China. 

But the low profile of those historic first 
injections, the military collaboration with a 
“private” company, and the ethically fraught 
decision to start with vaccinations outside of 
a clinical trial telegraphed that aside from 
the similar scale and speed, China’s vaccine 
effort is following a very different course 





vaccine development as an adviser to Bill 
Gates. “If they had plenty of cases in China, 
they could have finished an efficacy trial 
ahead of other people.” 

So China’s vaccine developers have gone 
abroad. Although the United States has shut 
them out of Operation Warp Speed, they have 
brokered deals with 14 other countries on five 
continents. They have mounted massive tri- 
als in the Arab world—and given candidate 
vaccines to top government officials there— 
and navigated toxic politics in Brazil, where 
the pandemic is raging fiercely, to test a vac- 
cine and explore producing it there. 

But China isn’t just seeking promising ven- 
ues for clinical trials. Not urgently needing 





Vaccine road trip 


With few COVID-19 cases at home, Chinese vaccinemakers have had to test the worth of their candidates 


abroad. Four are in efficacy trials in 14 countries. 
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Start date Size 


13,060 Russia 11 Sep. 500 
600 Pakistan, Mexico, 15 Sep. 40,000 
15000 Argentina, Chile 


*Two branches of China National Biotec Group (CNBG) are developing similar, but distinct, vaccines. 


from those in the United States and Europe. 
Most leading Western vaccines rely on sexy 
technologies such as genetically engineered 
viral vectors, designer proteins, and snippets 
of RNA. Three of China’s leading vaccine can- 
didates use an unfashionable stalwart: the 
whole inactivated virus, an approach that 
dates back to the first successful flu vaccine 
in the 1930s. 

And China’s vaccine effort is cursed by 
the country’s dramatic success with ag- 
gressive public health measures to stop the 
spread of the virus, SARS-CoV-2, including 
forced isolation of cases and testing of en- 
tire cities. Whereas the raging pandemic in 
the United States has enabled trials there 
to quickly deliver signals of efficacy, “China 
crushed the coronavirus epidemic early, so 
they lost the opportunity to test the efficacy 
of their vaccines there,” says epidemiologist 
Ray Yip, who closely follows COVID-19 
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the vaccines at home to fight a virus it has 
largely quashed, it is playing a global game by 
pledging to send any proven vaccine to coun- 
tries that are conducting trials for its candi- 
dates, or to share the technologies behind 
them. “They know they don’t need a vaccine 
to contain the epidemic in China,” Yip says. 
“They can take their sweet time.” 

Yanzhong Huang, a global health special- 
ist at both Seton Hall University and the 
Council on Foreign Relations, says the coun- 
try is “actually using the vaccine to promote 
the diplomacy of foreign policy objectives.” 
This “vaccine diplomacy” he says, contrasts 
starkly with Warp Speed’s “vaccine national- 
ism” and aims to “fill in the void left by the 
United States.” 

“It is avery carefully executed and carefully 
thought out strategy,’ says Stephen Morrison, 
who directs the Global Health Policy Center 
at the Center for Strategic & International 
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Studies. “A strategic goal of the Chinese gov- 
ernment is to achieve hegemonic influence in 
the bioeconomy within the next decade.” 

At home, too, attitudes toward vaccines 
contrast with those in the United States and 
Europe, where mistrust is high, Morrison 
says. To the consternation of vaccine experts 
overseas, hundreds of thousands of people in 
China have already lined up to receive the ex- 
perimental vaccines—even before their value 
and safety have been proved. “There has not 
been a collapse of faith and trust in science 
and in the state,’ Morrison says. “There’s less 
fear about where this is all going.” 


THE SPEED AT WHICH Chen and her col- 
leagues were able to get those first shots is 
all the more remarkable given that CanSino 
was arguably slow off the mark. 

Although some COVID-19 vaccinemakers 
launched their projects the day after the se- 
quence for SARS-CoV-2 became public on 
10 January, CanSino CEO Yu Xuefeng had 
reservations. “We started to look into it 
in the middle of January, but there was a 
hesitation,” he says. COVID-19, Yu worried, 
might be a blip, like severe acute respira- 
tory syndrome (SARS), another coronavirus- 
caused disease, which alarmed the world in 
2003. Companies and governments poured 
resources into developing vaccines, but the 
disease disappeared a year later. 

Originally from China, Yu completed his 
Ph.D. in microbiology at McGill University in 
Canada in 1997, and then stayed, working on 
vaccines for nearly 9 years at a Sanofi Pas- 
teur branch there. He co-founded CanSino— 
a portmanteau of Canada and China—in 
2009. A team led by Major General Chen 
back in China helped develop its only pre- 
vious product to receive approval: an Ebola 
vaccine based on a widespread and largely 
harmless virus known as adenovirus 5 (Ad5), 
into which they stitched a gene for the sur- 
face protein of the Ebola virus. 

Yu and his team considered making a 
COVID-19 vaccine with messenger RNA 
(mRNA) for the new coronavirus’ surface 
protein, called spike—the innovative ap- 
proach taken by Pfizer and its partner Bio- 
NtTech, the “winner” of the race to report 
preliminary efficacy data. But CanSino de- 
cided to go with what it knew, using the Ad5 
vector to carry the spike gene. “I thought 
that’s the quickest and the mature way of de- 
veloping a new vaccine,” Yu says. 

In just 1 month, CanSino’s candidate was 
ready to be given to Chen and her team, 
and on 16 March the company launched the 
world’s first COVID-19 vaccine trial, in Wu- 
han, to test its safety and ability to provoke 
immune responses. CanSino had beaten 
Moderna, which has an mRNA candidate, 
by 8 hours—though a world transfixed by 
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The Chinese company CanSino Biologics had the first COVID-19 vaccine to enter clinical trials, and by April had already progressed to a phase II study in Wuhan. 


the vaccine race among Western companies 
paid little attention. 

Several U.S. and European contenders, 
including AstraZeneca, have also adopted 
adenoviruses to carry the spike protein, some 
opting for an Ad5 vector similar to CanSino’s, 
despite several concerns about the approach. 
In 2007, two disastrous efficacy trials of an 
Ad5-based AIDS vaccine found that—for 
still-debated reasons—it actually raised the 
risk of HIV infection. The other worry is that 
preexisting immunity to Ad5 can attack the 
vector, which could explain why, in early tri- 
als, the CanSino vaccine elicited a weaker- 
than-expected antibody response. “We do 
see there’s some impact,” Yu concedes, “but 
its not black and white.” (The AstraZeneca 
vaccine’s preliminary efficacy data suggest 
immunity against its adenovirus vector may 
have compromised that candidate’s perfor- 
mance, as well.) 

The two other Chinese 
players, Sinovac Biotech 
and China National Biotec 
Group (CNBG)—a subsidiary 
of one of the world’s largest 
vaccinemakers, the state- 
owned Sinopharm—are taking 
a different approach: vacci- 
nating people with the whole, 
“killed” virus. This requires no 
sophisticated protein or RNA 
design or genetic engineering: 
Scientists simply inactivate 
the virus with a chemical (beta 
propiolactone) and mix it with 
an adjuvant (alum) that effec- 
tively puts the immune system 
on full alert by irritating it. In 
theory, such vaccines can pro- 
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duce broader antibody and T cell responses, 
because they contain the full set of viral pro- 
teins, rather than a single one such as spike. 
And unlike mRNA vaccines, which have to 
be stored at subzero temperatures, inacti- 
vated viruses require no more than ordinary 
refrigeration. 

But many scientists view inactivated vi- 
rus vaccines as outmoded, difficult to make 
in high volume, and potentially dangerous. 
Warp Speed outright rejected the approach. 
“I really don’t think the inactivated vaccine 
is a good idea,” says Moncef Slaoui, scientific 
head of Warp Speed. 

A major worry is that inactivated SARS- 
CoV-2 vaccines might trigger more severe 
illness, Known as “enhanced respiratory 
disease,” in immunized people who do get 
infected. Basically, if a vaccine triggers in- 
effective antibodies, they can form immune 
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Mohammed bin Rashid Al Maktoum (left), prime minister of the United Arab Emirates, 
on 3 November received a COVID-19 vaccine from CanSino Biologics. 
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complexes that clog the lungs. This occurred 
with a vaccine against respiratory syncytial 
virus given to children in the 1960s, and in 
animal experiments with vaccines against 
SARS and another coronavirus disease, Mid- 
dle East respiratory syndrome. The prospect 
of growing large batches of virus before kill- 
ing it also poses challenges; twice in the 
past 5 years, live poliovirus has escaped 
from European plants making inactivated 
virus vaccines for that disease. 

But inactivated virus vaccines, unlike 
mRNA and other technologies handsomely 
supported by Warp Speed, have a solid track 
record. “There are lots of different ways that 
vaccines are made, and it’s great that inno- 
vation is occurring alongside tried-and-true 
approaches,” says Nicole Lurie, a strategic ad- 
viser to the Coalition for Epidemic Prepared- 
ness Innovations (CEPI) who formerly served 
as U.S. assistant secretary for 
preparedness and response. 
“Inactivated vaccines are one 
of several tried and true ap- 
proaches?” Meng Weining, a 
senior director at Sinovac, says 
the company compared the 
inactivated approach—which 
it already uses to make six 
vaccines—with two other strat- 
egies in animal models. “The 
inactivated whole virus vaccine 
gave a much, much better re- 
sult,” Meng says. 

Although it is easier, in the- 
ory, to produce mRNA in vast 
quantities than it is to grow 
the virus on a similar scale, 
vaccine experts say produc- 
ing the inactivated virus vac- 
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cines is unlikely to be a hurdle. CNBG, for 
example, has “enormous resources: 10,000 
employees and scientists, huge manufactur- 
ing capability,’ says Nicholas Jackson, who 
heads CEPI’s China office and previously 
worked on vaccine R&D at Pfizer. “They are 
a very competent beast.” And, crucially for 
China’s vaccine diplomacy, many other coun- 
tries have manufacturers that have produced 
inactivated virus vaccines for decades. 

If China’s COVID-19 vaccines work, man- 
ufacturers say they could turn out 1.5 bil- 
lion doses in total next year. And countries 
that cannot access vaccines bankrolled 
by Warp Speed—especially countries that 
hosted China’s efficacy trials—might have 
a more secure vaccine supply. 


THE PRIME MINISTER of the United Arab Emir- 
ates (UAE), Sheikh Mohammed bin Rashid 
Al Maktoum, on 3 November tweeted a 
photo of himself in Dubai, the right sleeve 
of his kandura rolled high, being injected 
with a CNBG COVID-19 vaccine. “We wish 
everyone safety and great health, and we are 
proud of our teams who have worked relent- 
lessly to make the vaccine available in the 
UAE,” Al Maktoum wrote. Two of the coun- 
try’s top ministers had received the vaccine 
3 weeks earlier. 

UAE has become the cornerstone of 
CNBG’s efficacy trials and is following Chi- 
na’s controversial lead in allowing people to 
receive the vaccine outside of clinical trials. 

In a video conference on 23 June that 
linked UAE and China, health officials, am- 
bassadors, and CNBG executives sat at long 
tables in rooms decorated with each coun- 
try’s flags and celebrated their decision to 
stage an efficacy trial together. The trial has 
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In Brazil, Sinovac’s vaccine is caught in a battle between President Jair Bolsonaro and Sao Paulo’s governor, Jodo Doria. The mask says, “No vaccine! Oust Doria.” 


since expanded to Bahrain, Egypt, and Jor- 
dan and hopes to recruit 45,000 people. 

CNBG says it came to UAE to test its two 
whole virus vaccines—similar inactivated 
preparations made by two independent, and 
even competitive, laboratories—because the 
high SARS-CoV-2 infection rate there should 
speed an efficacy signal. But diplomacy and 
commerce also drove the decision. UAE’s 
enormous foreign workforce means trial 
participants come from 125 different coun- 
tries. “If you can prove these vaccines work 
in UAE,” Huang says, “that means everybody 
in the world would think that the vaccine 
would work in their countries, too.” 

China may be hoping for a public rela- 
tions (PR) benefit as well: UAE and many 
of the other collaborating countries have 
large Muslim populations, which Huang 
says could help mitigate human rights com- 
plaints about China’s treatment of Uyghur 
Muslims in Xinjiang province. “They cer- 
tainly don’t want to have more enemies over- 
seas,” he says. 

Huang adds that through its array of 
overseas trials, China hopes to build good- 
will for its Belt and Road Initiative (BRI), 
a massive investment in infrastructure 
in more than 100 countries to increase 
trade. Critics have charged that the BRI is 
“debt-trap” diplomacy that’s a form of neo- 
colonialism. “China wants to work with 
these countries and prioritize them to have 
the vaccine because I think they believe 
this is going to facilitate the implementa- 
tion of the BRI,” he says. 

China’s vaccine diplomacy has not always 
gone smoothly. On 9 November, after Brazil 
suspended a trial of Sinovac’s vaccine fol- 
lowing the death of a participant, President 
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Jair Bolsonaro took to Facebook. “Morte, in- 
validez, anomalia; he wrote, quoting from 
a Brazilian health agency that had listed 
possible reasons for the suspension: death, 
disability, genetic anomalies. Bolsonaro’s 
message was clear: This Chinese vaccine, 
CoronaVac, was dangerous. 

“Many people were really taken aback be- 
cause of that post,’ says Esper Kallas, who 
heads the vaccine trial site at the University 
of Sao Paulo that the participant had joined. 
“He was celebrating the failure of a vac- 
cine.” For Bolsonaro, it was a PR victory over 
his arch political rival, the governor of Sao 
Paulo, who backed the CoronaVac trial. The 
president was also delighting in an apparent 
setback for China, which Bolsonaro, like his 
ally, U.S. President Donald Trump, has criti- 
cized relentlessly. 

It turned out the participant died from 
a drug overdose. His death had nothing to 
do with CoronaVac, and the efficacy trial 
quickly resumed. 

China chose to navigate Brazil’s daunt- 
ing politics because with an out-of-control 
pandemic—it is third in the world in total 
infections, with more than 100,000 new 
cases every week—the country is a mag- 
net for vaccine testing and is desperate 
for vaccines. Sao Paulo state in Septem- 
ber committed $90 million to Sinovac for 
46 million doses. (This, notably, is 10 times 
cheaper than what the U.S. government is 
paying for the Pfizer-BioNTech and Mod- 
erna MRNA vaccines, which appear to be 
powerfully effective.) And Brazil could aug- 
ment the supply by making vaccine itself: 
Sinovac says it may transfer its technology 
to the Butantan Institute, a major vaccine 
manufacturer in Sao Paulo, a collaboration 
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that Meng describes as a “win-win.” 

China has had warmer receptions in other 
countries. Turkey in September launched a 
13,000-person efficacy trial of Sinovac’s vac- 
cine. Serhat Unal, who heads the Hacettepe 
University Vaccine Institute—which is simi- 
lar to Butantan—and is on the scientific 
board of the Ministry of Health, says Tur- 
key has “a good infrastructure for the phase 
III studies” and, unlike the United States 
and much of Europe, welcomed a Chinese 
vaccinemaker. 

The three Chinese manufacturers also 
have large efficacy trials planned or under- 
way in Indonesia, Pakistan, Saudi Arabia, 
Mexico, Argentina, and Chile (see map, 
p. 1264). It’s a good strategy, Unal says. 
“When you do the phase III in different 
countries, it’s more transparent, it’s more 
trustworthy,” he says. 

As much as vaccine diplomacy and the 
“soft power game” influence where the Chi- 
nese vaccinemakers cut deals for efficacy 
trials, they are also driven by capitalism, 
says Yip, who for 4 years headed the China 
office of the U.S. Centers for Disease Control 
and Prevention. “Everybody’s clamoring for 
some COVID vaccine,” he says. “They all 
want to tell their people we have secured 
some vaccine for you.” And Chinese compa- 
nies aim to profit by supplying it. 


IT’S A SAFE BET that one or more of China’s 
overseas trials will announce efficacy data 
any day. The results so far for other vaccines 
have fed a growing sense that many of the 
candidates will wallop what is, from a vac- 
cine’s point of view, a somewhat wimpy vi- 
rus. But China is not waiting for the phase 
III results before widely using its vaccines 
at home. Its regulators appear to be satis- 
fied with animal studies, combined with 
the minimal safety and immune response 
data from phase I and II trials. In June, 
CanSino received an emergency use autho- 
rization to vaccinate the military, and since 
then both Sinovac and CNBG have received 
green lights to vaccinate large populations 
outside of clinical trials. 

With the pandemic vanquished at 
home, China is vaccinating its people as 
insurance—often, against a dangerously 
infected world. CanSino’s Yu says “thou- 
sands” of troops on peacekeeping missions 
have received his company’s vaccine before 
traveling to places that have a high bur- 
den of COVID-19. CNBG says “hundreds 
of thousands” of people in China have re- 
ceived its vaccines. “By doing this, we are 
able to build an immune barrier among 
specific groups of people like healthcare 
workers, pandemic prevention person- 
nel, and border inspection personnel,” the 
company explained in its written replies to 
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Science. Vaccination is “completely volun- 
tary with informed consent,’ CNBG stresses. 
What’s more, “We did not receive a single 
case report of severe adverse reaction, and 
no infections reported for vaccinees work- 
ing in high risk areas.” 

Sinovac’s Meng says “more than 90%” 
of the company’s employees have received 
its vaccine because they are considered a 
high-risk group; he received it because he 
travels overseas. (According to China’s Min- 
istry of Culture and Tourism, 155 million 
Chinese people traveled overseas in 2019, 
and 145 million tourists visited the coun- 
try.) In October, the company 
began to sell its vaccine—$60 
for two doses—in Yiwu, a 
city in Zhejiang province. 

Yip says the government was 
even considering vaccinating 
all of Beijing after a COVID-19 
outbreak there in June. Yip says 
officials “already had written 
the guidelines”; if more than 
500 cases had surfaced, “they 
would shoot up everybody in 
Beijing with the vaccine.” In 
the end, contact tracing, test- 
ing, and isolation of infected 
people limited the outbreak to 335 cases. 

Morrison says the Chinese government 
has clearly “decided at the highest levels” 
that it’s worth the gamble to create “facts 
on the ground” and gain a global market- 
ing advantage by having the first COVID-19 
vaccines in wide use. “It’s high risk and it’s 
potentially high gain,” he says. 

But what if harm does occur? “You 
shouldn’t apply the rules of peacetime during 
the war. Our lives are turned upside down,” 
says Yip, who lives part-time in Beijing. 


IF ITS VACCINE GAMBLE succeeds, China’s 
image will gain a boost both at home and 
abroad. “They have reputational problems, 
internal and external,” Morrison says. 

In May, China’s President Xi Jinping told 
the World Health Assembly, which governs 
the World Health Organization (WHO), 
that the country would make its COVID-19 
vaccines “a global public good,” a some- 
what vague declaration that had many 
China watchers scratching their heads. But 
then China followed up on this commitment 
in October by joining the COVID-19 Vaccines 
Global Access (COVAX) Facility, an effort led 
by WHO, CEPI, and Gavi, the Vaccine Alli- 
ance, to make sure that any products proved 
safe and effective quickly reach rich and 
poor countries alike. 

Although joining COVAX arguably gives 
China an insurance policy to obtain vaccines 
if its own candidates fail, Morrison says 
it’s primarily a diplomatic move. COVAX 
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“They know they 
don't need a 
vaccine to contain 
the epidemic... 
They can take 


their sweet time.” 


Ray Yip, 
vaccine expert 


hasn’t received support from the United 
States or Russia, and China sees that it 
“could have a controlling influence over a 
major international mechanism.” In ad- 
dition, says Alexandra Phelan, a lawyer at 
Georgetown University’s Center for Global 
Health Science and Security who specializes 
in China, “It is a good act of a global citizen 
to support this effort.” 

If a Chinese-made vaccine proves safe and 
effective, it could help people forget that the 
pandemic started there and how badly the 
government responded at first, Morrison 
says. And at home, it could brighten the im- 
age of China’s vaccinemakers. 
Chinese citizens have reeled 
from a series of scandals over 
the past decade that include 
ineffective diphtheria, per- 
tussis, and tetanus vaccines; 
improper records for a rabies 
vaccine; and sales of an ex- 
pired polio vaccine. 

In something of a twist, 
Yip says China’s middle class 
may prefer a vaccine from a 
reputable foreign company. 
“Their level of confidence 
in Chinese-made vaccines 
is quite low because of all the repeated 
scandals,” he says. AstraZeneca and Pfizer 
have agreements to produce their prod- 
ucts with Chinese manufacturers. “They 
will outsell the CNBG, Sinovac, and Can- 
Sino [vaccines] by 10 to one—and they will 
charge 10 times more,’ Yip predicts. 

A successful Chinese-made COVID-19 vac- 
cine that has been scrutinized by outside 
regulators would reassure the domestic mar- 
ket, Phelan says. “There’s a lot of domestic 
ground to make up.” 

In Brazil, Kallas says a similar dilemma 
could play out if Butantan, as hoped, starts 
to make Sinovac’s CoronaVac. “There is a 
saying here that the neighbor’s chicken is al- 
ways the most tasty,’ Kallas says. “We have 
this perception that everything we do is not 
as good as the imported thing.” 

But for now, Brazil is embracing the Chi- 
nese vaccine. With cases surging, the arrival 
of a mere 120,000 doses of CoronaVac on 
19 November became a big news story. The 
bias against China is little more than a far- 
right political “contamination,” Kallas says, 
and most Brazilians see CoronaVac as “a vi- 
able option.” 

“T’d take it, no questions asked—this is a 
no-brainer,’ he adds. “The Pfizer and Mod- 
erna news was taken as a relief, but the 
problem is that both these vaccines are not 
in Brazil’s grasp.” 

In Brazil, as in much of the world, China’s 
warp speed vaccines may still, in the most 
meaningful way, come in first. 
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Diversity hotspots: 
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Phylogeny of suboscine passerine 
birds finds lower speciation rates 


in biodiverse regions 
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By Hélène Morlon 


rom the tropical Andes to the Mediter- 
ranean Basin, some regions of Earth 
host an extraordinary number of spe- 
cies compared to less diverse regions. 
Robust  species-level phylogenies 
(which map evolutionary relatedness) 
obtained from genetic sequencing are key for 
understanding the ecological and evolution- 
ary processes at the origin of such species 
richness gradients. Yet, they are still incom- 
plete for species-rich groups, even among the 
most studied organisms such as birds. On 
page 1343 of this issue, Harvey et al. (I) as- 
semble an impressive phylogenomic dataset 
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for the largest Neotropical bird group, the 
suboscine passerine radiation. Their analysis 
of these data suggests that species origina- 
tion is slower, rather than faster, in hotspots 
of suboscine diversity and that time, rather 
than speciation rates, explains suboscine di- 
versity gradients. 

Species-level phylogenies have been in- 
strumental in understanding the origin of 
biodiversity hotspots and richness gradients. 
Coupled with geographic data and models 
of speciation and extinction, comprehen- 
sive phylogenies provide useful information 
about the pace of diversification (the balance 
of speciation and extinction) in space and 
time (1-7). Yet, large-scale phylogenetic trees 
are often constructed with few genetic mark- 
ers, or even by including species without ge- 
netic data. Increasing the robustness of these 
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trees is important to obtain robust diversifi- 
cation rate estimates. As Harvey et al. show, 
their analyses on a less robust suboscine tree 
(2) would not have detected the negative as- 
sociation between present-day speciation 
rates and species richness. 

The relationship between existing spe- 
cies richness and speciation rates has often 
been discussed in the context of understand- 
ing why biodiversity peaks in the tropics. 
Different hypotheses for this latitudinal 
gradient in diversity lead to different pre- 
dictions regarding diversification rates (8). 
According to the time hypothesis, the tropics 
are older and therefore have had more time 
to accumulate species, even in the absence 
of any difference between tropical and tem- 
perate diversification rates. Whereas the di- 
versification rate hypothesis posits that the 
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The giant antpitta (Grallaria gigantea) in Ecuador 
is one species of thousands of suboscine passerines. 


tropics accumulate species faster, either be- 
cause they are “a cradle” of biodiversity with 
high speciation rates, or because they are “a 
museum” of biodiversity with low extinction 
rates. Until recently, phylogenetic and pale- 
ontological data generally supported the di- 
versification rate hypothesis, with evidence 
for higher speciation rates in the tropics 
where diversity peaks (8). 

The negative association between spe- 
ciation rates and species richness found 
by Harvey et al. contravenes this evidence, 
echoing recent studies in birds at different 
elevations (5) and latitudinal studies in fish 
(6) and flowering plants (7). Harvey et al. 
suggest that species-rich areas are coldspots 
rather than hotspots of speciation and that 
the diversity of these areas is explained by 
their old age and/or low extinction rates. If 
this interpretation is correct and the results 
shared with other species groups, focus may 
need to change from understanding why spe- 
ciation rates are higher in biodiverse regions 
such as the tropics to understanding why 
they are lower. But, should conclusions be 
drawn so hastily? 

Recent studies have focused on speciation 
rates “at the tips” of phylogenies, i.e. present- 
day speciation rates (9). Earlier studies based 
on sister species comparisons and therefore 
also focused on the recent past had similarly 
found lower speciation rates where biodiver- 
sity peaks (10). By contrast, studies integrat- 
ing information from the distant past have 
often found higher speciation rates in diver- 
sity hotspots (3, 4, 8). Therefore, one explana- 
tion for the apparent paradigm shift is that 
current hotspots of biodiversity were once 
hotspots of speciation, but speciation rates 
decreased drastically in these areas to reach 
present-day levels below those of species- 
poor areas. Such a decline in speciation rate 
is expected if species richness has an upper 
limit set by environmental conditions such as 
resource availability (77). Under this “ecologi- 
cal limits” hypothesis, regions that accumu- 
lated species rapidly in the past are closer to 
their biodiversity capacity than species-poor 
regions, and are therefore less prone to pres- 
ent-day speciation. 

Harvey et al. found mixed support for this 
hypothesis. The best model to fit their phy- 
logenetic tree is one that contains a causal 
link between climatic variables and species 
richness, and an inverse correlation be- 
tween species richness and speciation rates. 
This suggests that the number of species in 
a given region modulates speciation rates. 
Conversely, Harvey et al. did not find evi- 
dence for the speciation slowdown expected 
under the ecological limits hypothesis. 
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Given that the existence of ecological limits 
is highly debated (12), additional evidence 
is needed, or other plausible explanations 
should be investigated. 

The analysis by Harvey et al. of recent spe- 
ciation rates, along with that of others (5-7), 
forces reconsideration of traditional evolu- 
tionary hypotheses for explaining richness 
gradients. However, recent speciation rates 
are only one piece of the puzzle. A thorough 
understanding of species richness gradients 
requires mapping speciation and extinction 
rates to geography not only in the present, 
but also in the past, while allowing these 
rates to vary through time. This is a challeng- 
ing task, but not one that cannot be tackled. 
For example, geographically dependent di- 
versification models have allowed estima- 
tion of temporal variations in tropical and 
temperate speciation rates (3). An alternative 
approach for mapping past rates to geogra- 
phy is to couple recent approaches that allow 
estimation of lineage-specific diversification 
rates (13) to ancestral biogeographic recon- 
structions. Given that speciation rates esti- 
mated from only extant data have diminished 
reliability for historical rates, and that extinc- 
tion rates, which may play an important role 
in modulating richness gradients, are notori- 
ously difficult to estimate from reconstructed 
phylogenies (/4), further efforts to incorpo- 
rate direct information from the past (ie., 
fossils) will be particularly useful (75). 

The assembly and analysis of comprehen- 
sive phylogenomic data for the largest tropi- 
cal bird radiation by Harvey et al. contribute 
to mounting evidence that new species may 
not always be preferentially generated in 
biodiversity hotspots. These findings sug- 
gest a new burning question: Have diversity 
hotspots always been coldspots of speciation, 
or did they turn from hotspots to coldspots of 
speciation as diversity accumulated? 
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STRUCTURAL BIOLOGY 


Using genetics 
to reveal protein 
structure 


Measurements of genetic 
interactions in vivo 

can be used to determine 
protein structure 


By Dong Wang? 


tructure explains function. Knowing 
the three-dimensional (3D) shape and 
the arrangement of atoms in a biomol- 
ecule can help us understand how it 
works. For example, the DNA duplex 
structure provides insights into how 
genetic information is copied (7). The cur- 
rent tools for structural determination are 
primarily biophysical methods, such as x-ray 
crystallography, nuclear magnetic resonance 
(NMR) spectroscopy, and cryo-electron mi- 
croscopy. Now, structural biologists are about 
to embrace a new tool for structural deter- 
mination in cells. On page 1294 of this issue, 
Braberg et al. (2) describe a genetic approach 
for structural determination using pheno- 
typic readouts from genetic perturbations. 
This approach opens new doors for the struc- 
ture determination of many biological com- 
plexes in their native environment in cells. 

It has long been recognized that the inher- 
ent information that controls the folding of 
peptide chains into the specific 3D structures 
of proteins is stored in their linear amino 
acid sequence (3). However, how to decipher 
and use this for 3D structure determination 
remains a challenge. A breakthrough came 
when scientists compared the primary se- 
quences of evolutionarily conserved protein 
homologs and found that the pairs of resi- 
dues that are close in 3D space tend to have 
strong correlations in terms of sequence vari- 
ations during evolution (covariation) (4). The 
information in these covariation patterns can 
be used for predicting protein folding with 
impressive success rates (4). Inspired by this 
success, scientists sought to test whether they 
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A genetic approach to determine protein structure 
The genetic approach reported by Braberg et al. involves generation of a point mutant library from which in vivo phenotype profiles [such as point mutant epistatic 
miniarray profiles (pE-MAPs) or chemical genetics miniarray profiles (CGAPs)] are constructed. Phenotypic similarities are then quantified by assigning maximal 
information coefficient (MIC) values, and three-dimensional models are constructed using spatial restraints from pairwise genetic perturbations, structural refinement, 
and validation. Integrative structure determination can also include inputs from other methods. 


Phenotypic readout 





Point mutations 


In vivo mutants 


could infer 3D structural information by ana- 
lyzing the experimental covariation pattern 
of two mutations in the same protein. 
Recently, several groups independently 
used deep mutational scanning (DMS), a 
technique that can generate and analyze 
functional consequences of a large set of 
point mutations in a protein, for structure 
determination (5-7). A dataset compris- 
ing the fitness effects of nearly all possible 
double mutants of a 56-amino-acid protein 
domain called IgG-binding domain of pro- 
tein G (GB1) was analyzed and used to deter- 
mine the structure of GB1 domains (5, 6). The 
central idea of this genetic approach is that 
strong effects on protein function may occur 
when two genetic mutations affect amino ac- 
ids that are proximal in 3D space. Thus, ge- 
netic interaction analysis can infer distance 
restraints that facilitate computing and refin- 
ing protein structure models. In addition to 
GBI, structures of several other protein do- 
mains were also computed using the same 
approach (5, 6). DMS can also be used to 
determine biologically active conformations 
in cellular environments, even for highly dy- 
namic multiconformational proteins such as 
a-synuclein (7). Moreover, an experimental 
method for protein structure determination, 
called 3D-seq, can compute 3D structures 
based on the analysis of sequence covaria- 
tion patterns from experimental evolution 
of bacteria (Escherichia colt) that mimicked 
the natural evolution of two distinct anti- 
biotic resistance proteins (8). However, it is 
not clear whether these approaches could be 
transferable to a much more complicated sys- 
tem, such as large protein complexes. 
Braberg et al. describe an integrative 
structure determination approach for yeast 
(Saccharomyces cerevisiae) protein com- 
plexes. They constructed phenotypic pro- 
files for point mutations of a target protein 
crossed against a library of genome-wide 
gene deletions or exposed to a collection of 
different environmental perturbations (see 
the figure). Phenotypic profiles for all muta- 
tions in the analysis were compiled, called 
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Phenotype profiling 


Second mutations 
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a point mutant epistatic miniarray profile 
(pE-MAP). They defined the maximal infor- 
mation coefficient (MIC) as a quantitative 
measurement of similarities between pairs 
of phenotypic profiles in the pE-MAP. A pair 
of proximal point mutations is more likely 
to have a high MIC value than a pair of dis- 
tal point mutations. Therefore, MIC values 
are informative about an upper distance 
bound between the residues. They then ap- 
plied this distance restraint information for 
structural determination. 

As a proof of principle, Braberg et al. de- 
termined the structure of the histone H3-H4 
complex based on ~500,000 genetic interac- 
tions of 350 yeast mutants. The computed 
ensemble structural model is notably accu- 
rate and precise in comparison with the x-ray 
structure. They further estimated that 35 to 
40 mutations per component are necessary 
to generate a reasonable model of a protein 
complex. Furthermore, the same pE-MAP 
could also be used to pair specific histone 
residues to their respective modifying en- 
zymes, which are only transiently associated 
with the histones in vivo. 

To demonstrate the generality of this ap- 
proach, Braberg et al. also determined the 
structure of two large subunits of RNA poly- 
merase II (Pol II) using a pE-MAP comprising 
~60,000 genetic interactions (9). They found 
that the accuracy of the pE-MAP-derived 
structural model is comparable to those based 
on chemical cross-links or coevolution analy- 
sis. In addition, they also obtained a reason- 
able structural model of two large subunits 
of bacterial RNA polymerase (RNAP) based 
on a chemical genetics miniarray profile (CG- 
MAP) of 44 RNAP point mutations exposed 
to 83 different environmental stresses. Thus, 
this approach can be transferable to other 
types of phenotypic profiles. Integrative 
structure determination uses different types 
of data to improve the accuracy and precision 
of the model (10). Indeed, the authors showed 
that model accuracy and precision can be fur- 
ther improved if they use restraints from both 
genetic interactions and cross-links. 
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Random initial configurations 


Apply restraints for structure determination 


Monte Carlo 
sampling 








Optimized models 


The genetic approach has several potential 
advantages over biophysical approaches. The 
structural data obtained from genetic muta- 
tions reflects the native state of proteins in 
vivo and does not require purification of 
samples. Therefore, it may help to solve the 
structures of complexes that are difficult 
to isolate and purify or those that are only 
transiently stable. However, there are some 
remaining challenges. The relationship be- 
tween phenotypic pE-MAP measurements 
and structure can be very complicated. For 
example, some mutations in distant posi- 
tions that are part of an allosteric network 
could give rise to similar profiles. Some mu- 
tations may perturb gene expression rather 
than protein function or cause major struc- 
tural changes. Better analysis methods are 
needed to filter out these “abnormal mu- 
tants” that could compromise the accuracy 
of structure determination. 

With recent advances in genome-editing, 
similar genetic approaches could be used 
for structure determination in human cells. 
Combinations of genetic approaches with 
other in situ approaches, such as superreso- 
lution imaging or cryo-electron tomography 
(11, 12), and with deep-learning artificial 
intelligence approaches (3), could further 
improve the resolution of structures of im- 
portant biological complexes in their native 
environment in cells. 
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BIOPHYSICS 


The glassiness of hardening protein droplets 


Protein condensates can age to form glasses that increase in viscosity but retain elasticity 


By Huaiying Zhang 


n addition to dissolving in the watery 
cytosol, proteins can self-assemble into 
materials with different mechanical 
properties, such as solid filaments or soft 
gels, to facilitate various cellular func- 
tions. In the past decade, proteins have 
been shown to undergo liquid-liquid phase 
separation (LLPS) to form liquid droplets 
in which proteins are highly concentrated 
but are still dynamic and fluid (J, 2). Many 
cellular compartments, such as nucleoli and 
stress granules, are phase-separated drop- 
lets. Although some droplets remain 
liquid for function, others harden into 
a less dynamic state over time (3, 4). On 


Some condensates harden into a less 
dynamic state in which they do not fuse 
but stick together after collision, and the 
proteins rearrange less within them (3, 4). 
Hardened condensates might be functional 
in that they can inhibit some chemical re- 
actions or provide structural rigidity. The 
hardening process can also be vital for cre- 
ating different material properties needed 
in a multistep cellular process. For example, 
in centrosome condensate formation with 
the protein spindle-defective protein 5 
(SPD-5), the initial dynamic liquid may al- 
low rapid protein incorporation early in the 


Phases that proteins form 


microtubules, from liquid condensates 
that concentrate monomers (9). Not only 
do condensate material properties change 
with environment, composition, and time, 
different material properties also coexist in 
subcompartments of a single condensate 
such as the nucleolus (15). These examples 
highlight the numerous ways that various 
condensate material properties can be com- 
bined to achieve complex functionality. 
Jawerth et al. characterized the material 
properties of reconstituted protein conden- 
sates that harden over time by combining 
optical tweezer manipulation and microrhe- 
ology. These hardening condensates be- 
have like Maxwell fluids that have both 
viscous and elastic components present 


page 1317 of this issue, Jawerth e¢ al. 
(5) report that the hardening protein 
droplets are Maxwell glasses. These 
are Maxwell fluids that age like glasses 
in that viscosity increases with age, 
whereas the elasticity changes little 
over time (see the figure). 

There are many distinctive features 
of LLPS that can be functionally rel- 
evant, one of which is the liquid prop- 
erty of the resulting droplets. Droplet 
fluidity is vital for ensuring proper 
chemical reactions occurring within 
the compartment. Disrupting the liq- 
uid properties of the nucleolus, for 
example, alters ribosomal RNA bio- 
genesis (6). In addition, droplet fusion 
can be used for force generation to 
organize cellular space, such as clus- 
tering genomic loci (7, 8) or bundling 
cytoskeleton filaments (9). 


Proteins dissolved in solution (middle square) can self-assemble 
into liquids, solids, and gels. Jawerth et al. report that proteins 
can also harden into a new phase, a Maxwell glass. Protein liquids 
and Maxwell glasses are easier to reverse (double arrows) than 
protein gels and solids (single arrows). 
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at all times. The elasticity of the Maxwell 
fluid changes little with age, indicating 
that hardening is not a gelation pro- 
cess in which molecules become cross- 
linked. However, the viscosity strongly 
increases with age, suggesting that the 
molecular dynamics are hindered by 
protein jamming within the liquid. 
Because this material exhibits the 
behavior of Maxwell fluid but ages 
like glass, the authors call it Maxwell 
glass. Agreeing with the rheological re- 
sults, no substantial structural changes 
within hardening condensates were 
observed with cryo-electron micros- 
copy. Condensate-size shrinkage and 
increased protein density within the 
condensate were observed with fluo- 
rescent microscopy. The origin of pro- 
tein jamming and how it links to pro- 
tein chemistry await to be determined 


However, condensed phases formed 
with LLPS are not simple liquids but 
have diverse and changing material 
properties and are collectively called 
biomolecular condensates (J). The de- 
pendence of droplet fluidity on environmen- 
tal factors (such as salt concentration) and 
condensate composition (such as RNA-to- 
protein ratio) has been revealed in vitro, with 
active rheology measured with optical twee- 
zers and passive rheology measured with 
microbeads (JO-12). Reconstituted droplets 
also undergo aging and maturation; that is, 
their material properties change with time. 
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cell cycle, whereas the hardened condensate 
may provide the centrosome with the rigid- 
ity required to resist microtubule-pulling 
forces during mitosis (4). 

In addition to hardening, some aged 
droplets even nucleate protein aggregates, 
including amyloids that are linked to vari- 
ous diseases (10, 13, 14). Thus, organizing 
biochemistry through LLPS may come at 
the cost of promoting pathological pro- 
tein aggregation. However, the ability to 
nucleate solids is also exploited to seed cy- 
toskeleton filaments, including actin and 
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but will be needed to understand why 
some condensates harden over time 
and others do not. 
Jawerth et al. followed the hard- 
ening of five different proteins 
[Caenorhabditis elegans protein PGL-3 (gua- 
nyl-specific ribonuclease pgl-3) and mam- 
malian proteins FUS (RNA-binding protein 
FUS), EWSR1 (RNA-binding protein EWS), 
DAZAP1 (DAZ-associated protein 1), and 
TAFI5 (TATA-binding protein-associated 
factor 2N)] with fluorescence recovery after 
photobleaching and studied the rheology 
of PGL-3 and FUS with optical tweezer and 
microrheology methods. Future work could 
expand the rheological studies to other con- 
densates. For example, porous meshwork 
in hardened Saccharomyces cerevisiae and 
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Schigosaccharomyces pombe Sup35 con- 
densates was observed with cryo-electron 
microscopy, which suggests that they could 
be gels (3). Rheology studies on hardening 
Sup35 condensates would help to deter- 
mine whether and how they age differently 
from condensates reported in this study. 
Nevertheless, it can no longer be assumed 
that all nondynamic condensates are gels 
because they may be Maxwell glasses. 

Distinguishing glass-like and gel-like re- 
sponses of hardened condensates is not only 
conceptually but functionally essential. Both 
gels and glasses can be structurally stable. 
Gel stiffness can be actively regulated by the 
degree of cross-linking and can be tailored 
to sustain different magnitudes of forces. A 
glass can act as a mechanical sensor, just like 
liquid droplets (7), because it can flow under 
stress. A jammed glass only allows small mol- 
ecules to pass, whereas the larger pores of a 
gel permit diffusion of macromolecules such 
as proteins. However, glasses are more eas- 
ily fluidized. A gel would be more desirable 
for a condensate where structure rigidity and 
chemical reactions are both needed, such as 
for centrosomes (4). A glass is suitable for 
slowing down all macromolecule movement 
through jamming and allowing small mol- 
ecules to pass through and quickly fluidize 
the content when needed, such as for stress- 
sensing condensates (3). 

It will be exciting to see which nondy- 
namic condensates in cells are gels and 
which ones are glasses, and how cells ex- 
ploit their material properties for functions. 
However, unlike the in vitro results reported 
by Jawerth et al., directly probing rheologi- 
cal properties of endogenous condensates 
remains technically challenging. Tools for 
forming de novo condensates in live cells in 
a controlled manner may be useful for en- 
gineering condensates suitable for optical 
tweezer manipulation or embedding micro- 
beads to follow condensate rheology (7, 8). 
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CORONAVIRUS 


Remembering seasonal 


coronaviruses 


Antibodies against seasonal coronaviruses 


react with SARS-CoV-2 


By Jenna J. Guthmiller’ and Patrick C. Wilson‘ 


evere acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) has dif- 
ferential effects according to age, with 
symptomatic and severe infections 
mostly occurring in older adults. One 
possible explanation for this varia- 
tion is that children and younger adults have 
more preexisting immunity against seasonal 
human coronaviruses (HCoVs) that cross- 
react with SARS-CoV-2, providing protection 
from severe and even symptomatic SARS- 
CoV-2 infection. Consistently, SARS-CoV-2 
cross-reactive memory CD4* and CD8* T cells 
against the spike protein, the major surface 
protein of coronaviruses, have been reported 
in unexposed individuals (1, 2). Whether hu- 
moral immunity (antibodies and memory B 
cells) against SARS-CoV-2 cross-reacts with 
seasonal HCoVs is now emerging. On page 
1339 of this issue, Ng et al. (3) and Shrock 
et al. (4) reveal that individuals exposed and 
unexposed to SARS-CoV-2 have cross-reactive 
serum antibodies against the spike protein of 
SARS-CoV-2 and seasonal HCoVs. 

Four seasonal HCoV strains cause cold 
symptoms in humans: 229E, NL63, OC43, 
and HKU1. Despite using different host re- 
ceptors for cellular entry, all HCoVs express 
the spike protein on their surface. The spike 
protein is composed of two subunits: S1 con- 
tains the receptor-binding domain (RBD), 
which is responsible for binding to host cell 
receptors, and S2 is critical for mediating vi- 
ral and host cell membrane fusion and cell 
entry. The fusion peptide of the S2 subunit 
is highly conserved among seasonal HCoVs 
and zoonotic coronaviruses, including SARS- 
CoV-2 (3), whereas S1 is more variable. 

Shrock et al. found that people unexposed 
to SARS-CoV-2 have cross-reactive antibody 
responses against an array of coronaviruses, 
including the recently emerged SARS-CoV-2. 
However, most of this preexisting immunity 
targets a few epitopes on the spike protein, 
nucleocapsid protein, and the nonstructural 
proteins that are encoded by open reading 
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frame 1. Using a sensitive flow cytometric as- 
say to detect cross-reactive serum antibodies, 
Ng et al. found that individuals unexposed to 
SARS-CoV-2 possessed neutralizing antibod- 
les against the S2 protein. Cross-reactive anti- 
bodies were class-switched to the mature an- 
tibody isotypes immunoglobulin G (IgG) and 
IgA, suggesting that B cells producing these 
cross-reactive antibodies were induced by a 
previous immune response against infection 
with seasonal HCoVs. 

Upon SARS-CoV-2 infection, individuals 
showed increased production of antibod- 
ies that cross-react with the spike proteins 
of SARS-CoV-2 and seasonal HCoVs, called 
back-boosting (see the figure). Back-boosting 
is a common phenomenon observed after 
influenza virus infection and vaccination 
that results in the recall of antibodies target- 
ing conserved epitopes of past circulating 
influenza viruses (5). Back-boosting can be 
directed to nonprotective epitopes, such as 
those on the unexposed nucleocapsid protein, 
which is referred to as original antigenic sin. 
However, back-boosting can lead to the recall 
of broadly neutralizing antibodies, as ob- 
served with the 2009 pandemic HIN1 influ- 
enza virus (6). Shrock et al. and Ng et al. both 
identified that SARS-CoV-2 infection boosted 
antibody responses against several conserved 
epitopes, including the fusion peptide of the 
S2 subunit. Together, these studies solidify 
the existence of cross-reactive humoral im- 
munity against HCoVs and SARS-CoV-2. 

Many of the cross-reactive antibodies are 
specific for epitopes on the fusion peptide 
or nearby epitopes on the S2 subunit. These 
antibodies likely neutralize coronaviruses by 
blocking viral membrane fusion and host 
cell entry (7). Although Ng et al. showed 
that cross-reactive antibodies neutralized 
SARS-CoV-2 infection in vitro, it is unknown 
whether cross-reactive antibodies specific 
for the fusion peptide prevent SARS-CoV-2 
infection or limit COVID-19 severity in vivo. 
Nonetheless, because the fusion peptide is 
highly conserved across coronaviruses (8), 
it is an attractive target for a universal coro- 
navirus vaccine that could generate broadly 
neutralizing antibodies and thereby protect 
against seasonal HCoVs, SARS-CoV-2, and fu- 
ture zoonotic coronavirus spillovers. 
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Most adults have been exposed to and have 
preexisting immunity against the seasonal 
HCovVs. Infection rates of seasonal HCoVs are 
higher in children and adolescents (9), poten- 
tially because children are in close contact in 
childcare settings and have little preexisting 
immunity against seasonal HCoVs. Ng et al. 
found that children and adolescents had the 
highest amounts of cross-reactive antibodies 
between seasonal HCoVs and SARS-CoV-2, 
suggesting that recurrent seasonal HCoV in- 
fections in young people may protect them 
from symptomatic and severe SARS-CoV-2 
infections by limiting viral infection. Adults 
unexposed to SARS-CoV-2 have serum an- 
tibodies against seasonal HCoVs that do not 
cross-react with SARS-CoV-2, suggesting that 
recurrent HCoV infection may be necessary 
to induce and sustain protective cross-reac- 
tive antibody titers. A longitudinal study in 
Michigan found that HCoV-OC43 was the 
most common strain to infect children (9). 
Because the HCoV-OC43 and SARS-CoV-2 
fusion peptides are highly conserved (3, 4), 
antibodies produced from 
HCoV-OC43 infection may 
provide protection against 
SARS-CoV-2 infection in chil- 
dren. Furthermore, S2-specific 
memory B cells commonly 
cross-react between the spike 
protein of OC43 and SARS- 
CoV-2 (10). Thus, prior OC43 
infection may train B cells to- 
ward conserved fusion peptide 
epitopes and subsequent expo- 
sure to OC43 or SARS-CoV-2 
may boost cross-reactivity. 

How preexisting humoral 
immunity against seasonal 
HCoVs affects the develop- 
ment of humoral immunity 
against SARS-CoV-2 remains 
unclear. Although humans 
have cross-reactive memory 
B cells that can be recalled by 
SARS-CoV-2 (3, 10), cross-reac- 
tive antibodies and memory B 
cells against seasonal HCoVs 
may hinder the development 
and maturation of high-affin- 
ity B cells targeting conserved 
epitopes shared between sea- 
sonal HCoVs and SARS-CoV-2. 
Recent work on Plasmodium 
falciparum, the parasite that 
causes malaria, revealed that 
nonprotective antibody ti- 
ters against the circumspo- 
rozoite protein limited the 
recruitment of high-affinity 
B cells against the same epit- 
ope (epitope masking) (11). In 
the context of coronaviruses, 
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Cross-reactive memory B cells induced by 
seasonal HCoVs will become activated and 
differentiate into antibody-secreting plasma 
cells that produce antibodies that cross-react 
with SARS-CoV-2 and seasonal HCoVs. 


cross-reactive antibodies against the fusion 
peptide of seasonal HCoVs may mask this 
epitope and limit the recruitment and de- 
velopment of high-affinity B cells against the 
SARS-CoV-2 fusion peptide. Rather, the hu- 
moral immune response may be diverted to- 
ward SARS-CoV-2-specific epitopes, such as 
those of the RBD. Therefore, the generation 
of a universal coronavirus vaccine that boosts 
antibodies against the fusion peptide may be 
difficult to achieve. With the likely rollout of 
SARS-CoV-2 vaccines, it will be interesting to 
understand the relationship between cross- 
reactive preexisting antibodies before vac- 
cination and the subsequent antibody speci- 
ficity induced by vaccination, particularly in 
children and older people who experience 
different severity of disease. 

Low antibody titers and low-affinity an- 
tibodies have the potential to enhance viral 
disease by facilitating viral uptake by host 
cells, a process called antibody-dependent 
enhancement (ADE). ADE is observed with 
atypical measles and serial infections with 





Humoral immunity against coronaviruses 
Upon repeated exposure to seasonal human coronaviruses (HCoVs), strain-specific and 
cross-reactive antibodies are generated. Upon subsequent exposure to SARS-CoV-2, 
cross-reactive antibodies are produced (back-boosted) to mitigate infection. 
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different dengue virus serotypes (2, 13). 
Because preexisting antibody titers against 
SARS-CoV-2 are relatively low and potentially 
low affinity, concern remains that this could 
lead to ADE. However, there is no evidence 
for ADE in SARS-CoV-2 infection, particu- 
larly because thousands of acutely infected 
people have received convalescent plasma 
from recovered COVID-19 patients, with few 
adverse events (14). Additionally, ADE has 
been suggested as a potential mechanism 
leading to multisystem inflammatory syn- 
drome in children (MIS-C), which is associ- 
ated with SARS-CoV-2 infection. However, 
no distinct antibody response was predic- 
tive of MIS-C occurrence (15), although this 
study did not analyze the presence of cross- 
reactive antibodies against seasonal HCoVs. 
Therefore, antibodies that cross-react with 
seasonal HCoVs and SARS-CoV-2 are unlikely 
to worsen COVID-19 and instead are likely to 
provide some protection against SARS-CoV-2 
infection and severe COVID-19. Together, the 
studies of Shrock et al. and Ng et al. highlight 
that further research is needed 
into how SARS-CoV-2 § anti- 
body responses are shaped by 
previous exposures to seasonal 
HCoVs and how this immunity 
can be harnessed to provide 
protection. 
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VIEWPOINT: COVID-19 


Ensuring vaccine safety 


Comprehensive safety testing is based on 


experience with prior vaccines 


By David M. Knipe’, Ofer Levy?34, 
Katherine A. Fitzgerald®, Elke Miihlberger® 


here is an urgent need for vaccines to 

protect against severe acute respira- 

tory syndrome coronavirus 2 (SARS- 

CoV-2) infection to reduce COVID-19 

and stop the current pandemic. 

Although bureaucratic delays should 
be reduced to accelerate vaccine availability, 
there remains the need for extensive safety 
testing protocols developed by the U.S. Food 
and Drug Administration (FDA) and other 
regulatory agencies. COVID-19 vaccines will 
be safe if regulatory agencies maintain their 
well-documented safety testing protocols. 
Safety should be considered at every phase of 
vaccine discovery, development, and testing. 
History provides a strong scientific basis for 
safety evaluation of all vaccine candidates, 
which must be maintained to realize their 
enormous potential. 

Vaccines are among the most successful 
medical and public health measures ever 
implemented (J). It is estimated that vaccines 
prevent ~6 million deaths globally per year 
(2). For example, vaccination eliminated vari- 
ola virus (which causes smallpox) and nearly 
all wild poliovirus and has greatly reduced 
measles virus infection. Vaccines against hep- 
atitis A and B viruses, rubella virus, mumps 
virus, influenza virus, human papilloma 
virus, varicella zoster virus (which causes 
chickenpox in children or shingles in adults), 
and yellow fever virus are broadly used and 
highly successful at reducing morbidity and 
mortality. The historical experience with vac- 
cine development has paved the way for a 
well-developed path for preclinical and clini- 
cal testing of vaccines to ensure their safety 
and efficacy, leading to safe vaccines that 
have saved millions of lives. 

Empirical experience, including evaluation 
of vaccine-associated adverse events, indi- 
cates the importance of thoroughly assessing 
safety of vaccine preparations before licensing 
and widespread use. Lessons learned include 
that all batches of vaccines must be tested 
for safety. For example, the need for system- 
atic adherence to formulation standards and 
safety testing of each batch of formalin-inac- 





tivated Salk polio vaccine before deployment 
became apparent after a tragic incident in 
1955. Although nearly all batches of the li- 
censed vaccine were safe, two batches from 
Cutter Laboratories were contaminated with 
live poliovirus because of incomplete inacti- 
vation. This resulted in abortive polio (char- 
acterized by headache, stiff neck, fever, and 
muscle weakness) in 40,000 individuals, 51 
cases of permanent paralysis, and five deaths 
and spread to family and community mem- 
bers (3). The Cutter incident led to strict new 
federal regulations, including current quality 
control measures to ensure strict adherence 
to vaccine formulation for inactivation of 
each vaccine batch. Inactivated polio vaccine 
preparations have been very safe since (4). 

An additional level of vaccine safety is pro- 
vided by understanding the mechanism of 
action and immune correlates of protection. 
For some vaccines, correlates of protection, 
such as the presence of specific neutralizing 
antibodies, are well established. A clear un- 
derstanding of correlates of protection can 
ensure that vaccines induce the optimal im- 
mune response for protection while avoiding 
nonproductive or counterproductive immune 
responses or disease. Two examples illustrate 
this point. A formalin-inactivated respiratory 
syncytial virus (FI-RSV) vaccine developed in 
the 1960s stimulated moderate amounts of 
serum antibodies, measured by complement 
fixation, but failed to protect against RSV in- 
fection or disease (5). Moreover, most children 
immunized with this inactivated RSV vaccine 
and that subsequently became infected with 
RSV were hospitalized, with enhanced respi- 
ratory disease. The FI-RSV vaccine was later 
found not to induce neutralizing antibodies 
in vaccinees (6), and in a cotton rat infection 
model it induced a T helper cell 2 (T,2)-bi- 
ased CD4* T cell response (7), which can lead 
to lack of protective immunity and immune 
pathology. Lack of viral neutralization and 
altered T cell responses are believed to have 
contributed to vaccine-enhanced disease in 
these immunized individuals. 

In another example, the first dengue vac- 
cine, a tetravalent chimeric yellow fever- 
dengue virus vaccine, reduced incidence of 
severe dengue disease in older children and 


was licensed for use in children >9 years of 
age. However, although overall reduction 
of severe disease and hospitalization over a 
5-year period was observed in the trials, pro- 
tection was highest in individuals who had 
been exposed to dengue virus before immu- 
nization, whereas immunization increased 
severe disease in seronegative individuals (8, 
9). Protection was lowest against one of the 
four dengue serotypes (DEN-2), suggesting 
that weak immune responses to DEN-2 or in 
general in seronegative individuals resulted 
in antibody-dependent enhancement (ADE) 
of disease. ADE is caused by non-neutralizing 
antibodies or antibodies at subneutralizing 
concentrations that promote infection by 
enhancing uptake of viral particles into host 
cells. ADE has been reported to increase infec- 
tivity of SARS-coronavirus in certain types of 
cells in culture (JO). Given these experiences, 
current vaccine evaluation in preclinical and 
clinical studies (see the figure) includes scru- 
tiny of vaccine immunogenicity in relation to 
correlates of protection to maximize efficacy 
and minimize potential detrimental effects. 
Another lesson learned is that if seri- 
ous adverse events are detected in a clini- 
cal trial, then additional clinical testing is 
indicated. For example, the first rotavirus 
vaccine, a rhesus-human reassortant ro- 
tavirus tetravalent vaccine (RRV-TV) with 
genome segments of human and rhesus ro- 
taviruses, was licensed in the United States 
in 1998. Rotaviruses cause severe and po- 
tentially fatal diarrhea in infants and chil- 
dren. Although the vaccine was effective in 
preventing gastroenteritis in infants, intus- 
susception (a painful form of bowel obstruc- 
tion due to bowel prolapse that can be fatal 
if left untreated) was reported in 5 of 10,054 
vaccinees, compared with one case in 4633 
placebo recipients, a difference that was not 
statistically significant. The perceived lack 
of serious adverse effects led to licensure of 
RRV-TV, with intussusception as a possible 
rare adverse reaction. However, after its ap- 
proval and in the first year of vaccine use, 
15 cases of intussusception were reported 
in vaccinees, in contrast to only 4 cases in 
the 7 years preceding vaccination, triggering 
suspension of the vaccine in 1999 (11). This 
example illustrates the importance of care- 
ful evaluation of any adverse reaction and 
postlicensure surveillance to ensure vaccine 
safety. Large prelicensure trials of two later 
rotavirus vaccines, a pentavalent human- 
bovine viral reassortant vaccine (RV5) and 
a monovalent single-strain human rotavirus 
vaccine (RV1) with compositions different 
from that of RRV-TV, demonstrated very low 
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Vaccine safety evaluation 


Safety is considered at every phase of vaccine discovery and development. Upon licensure, vaccines enter phase 4, 


whereby surveillance approaches by regulators, such as the U.S. Food and Drug Administration (FDA), monitor potential vaccine side effects. 
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incidence of intussusception, and these vac- 
cines are now widely approved for use. 

In 1976, an outbreak of influenza that re- 
sulted in one death among Army recruits 
at Fort Dix, New Jersey, followed by spread 
through the base, was ascribed to influenza 
virus isolates similar to the 1918 pandemic 
HIN1 virus that killed millions of people. 
Fearful that a potentially pandemic influ- 
enza virus was emerging, an emergency vac- 
cine initiative was approved by President 
Ford and funded by Congress. Meanwhile, 
scientific questions arose, including genetic 
evidence that the New Jersey isolates were 
similar to swine (HIN1) influenza viruses cir- 
culating in the United States at the time (72) 
and doubts about the virulence of the virus 
isolates, both suggesting that this was not an 
emerging pandemic virus. In addition, there 
was evidence that the vaccine being pro- 
duced lacked an antigenic form of the viral 
neuraminidase protein, which could render it 
less efficacious. President Ford nevertheless 
implemented the vaccine initiative. 

The influenza vaccine was tested in ~7000 
individuals in the spring of 1976 and was 
deemed to be safe, with broad immuniza- 
tion starting on 1 October. About 25% of 
the U.S. population was immunized before 
~450 cases of the paralytic Guillain-Barré 
syndrome (GBS) disease emerged, a statisti- 
cally significant increase above the normal 
population incidence (13). This influenza 
immunization program was terminated in 
December 1976 with severe consequences, in- 
cluding diminished public confidence in vac- 
cines and the public health care system. Later 
studies have shown minimal GBS associated 
with other influenza virus vaccines (13); thus, 
this particular vaccine formulation was likely 
problematic. Potential lack of neuraminidase 
activity, contamination with other microbes, 
and induction of autoimmune antibodies 
have all been suggested to account for these 
adverse effects. 
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As a result of the observations from the 
RRV-TV and 1976 swine flu vaccines, current 
regulatory practices require monitoring of 
rare adverse events pre- and postlicensure 
with detection of serious adverse events trig- 
gering a pause to trial or use. Such a safety 
pause enables study of adverse events to as- 
sess whether the vaccine trial or use can be 
resumed as deemed appropriate. Indeed, 
such safety pauses have occurred in current 
COVID-19 vaccine trials, underscoring the 
value of extensive regulatory safety protocols, 
which should not be rushed or undermined. 

History has also taught us the impor- 
tance of continued surveillance of potential 
vaccine-related adverse events and measure- 
ment of immunogenicity and outcomes even 
after licensure. Such surveillance—for exam- 
ple, through the U.S. Department of Health 
and Human Services Vaccine Adverse Event 
Reporting System (VAERS) and the FDA 
Biologics Effectiveness and Safety (BEST) 
electronic medical record-based platform— 
can confirm safety and increase understand- 
ing of the immune correlates of protection 
and mechanisms of immunogenicity to fur- 
ther enhance vaccine development. 

Multiple COVID-19 vaccine trials are cur- 
rently being conducted in parallel, with many 
additional candidates in preclinical develop- 
ment. A decision regarding one of these clini- 
cal trials that enables vaccine use before full 
regulatory approval—for example, through 
emergency use authorization (EUA)—would 
accelerate deployment at the potential risk of 
undermining the ability to recruit additional 
participants for that and other trials, thereby 
impairing collection of normally comprehen- 
sive data regarding safety, immunogenicity, 
efficacy, and durability of protection (14). To 
enhance public confidence in vaccines by 
providing transparency, the FDA Center for 
Biologics Evaluation and Research (CBER) 
has published master protocols for SARS- 
CoV-2 vaccine safety and effectiveness evalu- 
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ation (15). Overall, the interests of vaccinees, 
vaccine developers, pharmaceutical compa- 
nies, and regulatory agencies are aligned on 
the importance of ensuring vaccine safety. 

There is an urgent need for COVID-19 vac- 
cines and exciting progress to that end, but 
there remains a critical public health obliga- 
tion to conduct rigorous evaluation to ensure 
safety as well as efficacy. Vaccines remain one 
of the most successful biomedical tools for 
prevention of disease. The urgent need for 
COVID-19 vaccines must be balanced with 
the imperative of ensuring safety and pub- 
lic confidence in vaccines by following the 
established clinical safety testing protocols 
throughout vaccine development, including 
both pre- and postdeployment. 
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RETROSPECTIVE 


Angelika Amon (1967-2020) 


Trailblazing cell cycle biologist 


By Jason M. Sheltzer' and Rosella Visintin? 


ngelika Amon, a pioneering geneticist 

and cell biologist, died on 29 October 

after a long battle with ovarian cancer. 

She was 53 years old. Angelika made 

important discoveries involving the 

control of chromosome segregation 
and the consequences of segregation errors. 
An innovative and outspoken scientist, she 
thrived on challenging dogmas and tackling 
unorthodox questions. 

Born in Austria on 10 January 1967, 
Angelika first encountered the two loves of 
her life in high school. There, she met her 
future husband, Johannes Weis, who would 
become her constant companion for al- 
most 40 years and the father of her beloved 
daughters, Theresa and Clara. In a science 
class, she also came across a grainy, black- 
and-white movie of chromosomes separating 
during mitosis. She would later identify that 
movie as the catalyst that spurred her to a 
lifetime of studying cell cycle genetics. 

Angelika began her scientific career as a 
student in the laboratory of Kim Nasmyth 
at the Research Institute of Molecular 
Pathology in Vienna, where she made several 
fundamental discoveries illuminating how 
cyclins drive the yeast cell cycle. She showed 
that cyclin expression is carefully confined 
to discrete phases of the cell cycle by a com- 
bination of transcriptional and posttrans- 
lational regulation, which help to ensure 
proper progression through mitosis. After 
receiving her B.S. in 1989 and her Ph.D. in 
1993, both from the University of Vienna, she 
joined Ruth Lehmann’s lab at the Whitehead 
Institute in Cambridge, Massachusetts, for 
a brief postdoc on Drosophila embryogen- 
esis. Angelika soon realized that flies grew 
too slowly for her liking, and she left the 
Lehmann lab to begin her independent ca- 
reer as a Whitehead Fellow in 1996. 

In her own lab, Angelika decided to re- 
turn to yeast as a model organism for cell 
cycle biology. She was fond of proclaiming, 
“When you work with yeast, the rate-limiting 
factor is your brain.” She believed that the 
awesome power of yeast genetics gave sci- 
entists the ability to design clever experi- 
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ments to uncover the secrets of the cell cycle 
and that these fundamental processes were 
likely to be conserved across the tree of life. 
One of us (R.V.) joined Angelika’s lab as her 
first postdoc, and we embarked on an in- 
credible scientific journey. Together, we dis- 
covered that exit from mitosis requires the 
phosphatase Cdcl4, which is released from 
the nucleolus during anaphase. It was the 
first time that a phosphatase was shown to 
link mitosis and the subsequent G1 phase, 
and it established a new paradigm for how 
cell cycle-driven phosphorylation is regu- 
lated and reset after mitosis. 





A! 


In 1999, Angelika became a faculty member 
at the Massachusetts Institute of Technology 
Center for Cancer Research (now the Koch 
Institute for Integrative Cancer Research), 
and she quickly secured an appointment 
as an investigator for the Howard Hughes 
Medical Institute. There, her lab continued to 
explore cell cycle control, and, among many 
seminal discoveries, she identified the ge- 
netic pathways that regulate chromosomal 
cohesion during both mitosis and meiosis. 
She also turned her attention to a new sci- 
entific problem: aneuploidy, or the conse- 
quences of chromosome segregation errors. 
For many years, it had been appreciated that 
the dysregulation of the cell cycle can lead to 
the generation of aneuploid progeny and that 
most human tumors are aneuploid. However, 
no scientist had ever attempted to system- 
atically study aneuploid cells or understand 
how they are linked to tumorigenesis. As 
she did in so many cases, Angelika decided 
to push past what was known and directly 
interrogate the consequences of aneuploidy. 
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By using a series of clever genetic tricks, 
Angelika and the members of her lab created 
a set of yeast (and eventually, mouse) cells 
that had defined chromosomal aneuploidies. 
They discovered that aneuploidy of differ- 
ent chromosomes causes a stereotypical set 
of physiological changes, including altered 
proliferation, transcriptional deregulation, 
genomic instability, and metabolic abnor- 
malities. One of us (J.M.S.) joined Angelika’s 
lab as a graduate student to study the 
role of aneuploidy in tumor development. 
Unexpectedly, we found that aneuploid cells 
are resistant to oncogenic transformation, 
demonstrating that aneuploidy, rather than 
being just an early event in tumorigenesis, 
is an anticancer barrier that cells must over- 
come. Angelika’s work on aneuploidy helped 
establish a new field of biological research, 
and her insights into the consequences of 
aneuploidy have spurred a widespread effort 
to identify aneuploidy-targeting compounds 
that could potentially be used to treat aneu- 
ploid cancers. 

Angelika’s scientific achievements were 
recognized by many awards and honors, in- 
cluding the National Science Foundation’s 
Alan T. Waterman Award (the highest 
American honor for a scientist under 40), 
the Paul Marks Prize for Cancer Research, 
the Vanderbilt Prize in Biomedical Science, 
and the Breakthrough Prize in Life Sciences. 
She was elected to the National Academy of 
Sciences in 2010 and the American Academy 
of Arts and Sciences in 2017. Undoubtedly, 
she would have had many more discoveries 
ahead of her; on the day of her death, she 
published a preprint on bioRxiv that estab- 
lishes a link between cytoplasmic volume 
and stem cell potential. 

The life of every party, Angelika always 
had a brilliant story to tell, good advice to 
give, juicy gossip to loudly whisper, or a fas- 
cinating question to pose. In her lab, she 
brought together researchers from all over 
the globe and turned them into a surrogate 
family. She was a fierce defender of each of 
her students and was an especially strong 
advocate for the careers of the women whom 
she trained. 

In her final weeks, Angelika remained a 
committed mentor and scientist. She contin- 
ued to lead group meetings and advise her 
students from the confines of her hospital 
bed. As her current lab members wrote to 
her a few hours before her death, “Do not 
worry about us! You have equipped us with 
everything we need to succeed.” While the 
scientific world will surely miss Angelika’s 
joyful, boisterous presence, it is much richer 
for her remarkable discoveries and for the 
generations of scientists whose lives she in- 
exorably changed. 

10.1126/science.abf7124 
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ETHICS: COVID-19 


COVID-19 vaccine trial ethics 
once we have efficacious vaccines 


Some placebo-controlled trials can continue ethically 
after a candidate vaccine is found to be safe and efficacious 


By David Wendler,! Jorge Ochoa,! Joseph 
Millum,2 Christine Grady,! Holly A. Taylor’ 


he unprecedented effort to identify 

one or more safe and effective vaccines 

for COVID-19 includes more than 180 

candidates in development (J), with at 

least 12 in phase 3 trials (2). The test- 

ing of so many vaccine candidates, in 
a pandemic of a disease for which there are 
to date limited treatment options, raises a 
critical challenge: What should researchers 
do if a vaccine candidate is judged to be safe 
and efficacious? Guidance from the U.S. Food 
and Drug Administration (FDA) states that 
in the event that a COVID-19 vaccine candi- 
date is judged to be “safe and effective; dis- 
cussion may be necessary “to address ethical 
arguments to break the blind and offer vac- 
cine to placebo recipients” (3). We consider 
here two questions raised by this guidance: 
First, if a vaccine candidate is found to be 
safe and efficacious in a placebo-controlled 
trial, should the researchers continue that 
trial as designed? Second, should research- 
ers continue to test other vaccine candidates 
using placebo-controlled trials? These two 
questions are especially timely given recent 
announcements by Pfizer and Moderna that 
their vaccine candidates have been found 
to be efficacious in preventing symptomatic 
COVID-19 (4, 5). 


VACCINE SAFETY AND EFFICACY 

A finding of efficacy occurs when a vaccine 
candidate being tested in a phase 3 trial meets 
its efficacy end point, either during an in- 
terim analysis or after study completion. The 
primary efficacy end point of current phase 
3 trials is whether the vaccine candidate pre- 
vents symptomatic COVID-19 in individuals 
newly infected with severe acute respiratory 
syndrome coronavirus 2 (SARS-CoV-2) (6). A 
finding of efficacy in these trials thus implies 
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that the vaccine candidate protects infected 
individuals from experiencing symptoms. A 
finding of safety occurs when there is suffi- 
cient data on the impact of the vaccine can- 
didate to conclude that its potential benefits 
justify its risks in the target population. 

A single finding of safety and efficacy 
may not be sufficient for a vaccine candi- 
date to receive FDA approval. Depending on 
the strength of the data, it may be impor- 
tant to conduct additional research to learn 
more about the vaccine candidate and gain 
greater confidence in its safety and effi- 
cacy before offering it to millions of people. 
For example, the FDA frequently requires 
a finding of efficacy in two phase 3 trials 
before approving medical interventions for 
marketing to the public. 

Still, a finding of safety and efficacy in a 
phase 3 trial is an ethically critical point in 
the vaccine development process. From that 
point, receiving the vaccine candidate is 
known to offer protection to many individu- 
als. Hence, receiving a placebo instead is con- 
trary to their interests, even though it may be 
necessary to collecting socially valuable data. 

This tension between protecting research 
participants and gathering data that may 
benefit others is common to clinical research, 
and there is substantial literature on how to 
address it (7). Although this literature applies 
to the ethics of vaccine trials, the present sit- 
uation is unparalleled. 

There is a global crisis for which there are 
currently limited treatment options. Hence, 
billions of individuals could benefit from a 
safe and efficacious vaccine. And having so 
many vaccine candidates in development 
substantially increases the chances that 
one or more will be found to be safe and ef- 
ficacious while others are still being tested. 
Limitations on current treatment options 
mean that it is in each individual’s interests 
to receive the first vaccine found to be safe 
and efficacious, rather than participate in 
vaccine trials where they might receive pla- 
cebo or an unproven vaccine candidate. 

In addition, given the challenges of man- 
ufacturing sufficient supply for so many 
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people, and given that different vaccine can- 
didates may be efficacious in different popu- 
lations, several vaccines may be needed to 
meet the global need. This highlights the po- 
tential social value of conducting additional 
trials after one or more vaccine candidates 
are found to be safe and efficacious. 

Moreover, vaccine candidates typically 
have not been widely available until after 
they receive marketing approval by the FDA. 
However, the FDA has indicated that it may 
take the unprecedented step of making a 
vaccine candidate that is found to be safe 
and efficacious widely available through an 
emergency use authorization (EUA) before 
marketing approval. An EUA involves the 
FDA commissioner allowing unapproved 
medical products to be used in an emergency 
to diagnose, treat, or prevent serious or life- 
threatening diseases or conditions “when 
there are no adequate, approved, and avail- 
able alternatives” (8). 


PARTICIPANTS, NOT PATIENTS 

Pfizer recently announced that at the first 
interim analysis, its vaccine candidate was 
found to be 95% effective at preventing 
symptomatic COVID-19 (4, 5). A week later, 
Moderna announced that its vaccine can- 
didate was found to be 94.5% effective. At 
that point, the trials had enrolled more than 
43,000 and 30,000 participants, respectively, 
of whom 94 and 95 developed symptomatic 
disease. If these findings are confirmed, they 
will represent an important public health 
advance, and provision of the vaccine candi- 
dates would be in the interests of many in- 
dividuals. The FDA may thus grant an EUA, 
making either or both vaccines available be- 
fore each study’s completion and also before 
FDA marketing approval. 

Some commentators argue that clinical 
trials are ethically acceptable only as long 
as there is insufficient evidence that the in- 
tervention offered in one arm is superior to 
what is offered in another arm, or to what is 
available outside the trial (9). This view im- 
plies that it would be unethical to continue 
Pfizer’s and Moderna’s placebo-controlled tri- 
als given the findings of efficacy. It also im- 
plies it would be unethical to test other un- 
proven vaccine candidates against placebo. 

Proponents defend this view on the 
grounds that it provides a way to protect 
participants from excessive risks. They argue 
further that researchers conducting clinical 
trials are obligated to treat participants con- 
sistent with their clinical interests and con- 
clude that it is unethical to give participants 
a placebo once a safe and efficacious vaccine 
has been identified. 

We disagree. This view fails to recognize 
that the obligations researchers have to 
their participants are distinct from the obli- 
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gations that clinicians have to their patients 
(9). It further ignores the existence of other 
mechanisms, including independent review 
and informed consent, to protect research 
participants from excessive risks. The dif- 
ferences between the ethics of clinical re- 
search and clinical care are reflected in the 
consensus that it can be ethically appropri- 
ate to invite research participants to accept 
some risks to collect socially valuable data. 
In particular, codes and guidelines around 
the world permit researchers to expose par- 
ticipants in clinical trials, including vaccine 
trials, to some risks to collect socially valu- 
able data that cannot be obtained in a less 
risky way (10). 

These guidelines have a critical impli- 
cation for COVID-19 vaccine trial ethics: 
Researchers are not obligated to pro- 
vide placebo recipients with a safe and 
efficacious vaccine once one has been 
identified. Instead, researchers are ob- 
ligated to ensure that any plans to con- 
duct placebo-controlled trials remain 
ethically appropriate given current ev- 
idence. Such plans must be reviewed 
and approved by an independent 
board, known as an institutional re- 
view board (IRB) in the United States, 
which must find that a number of con- 
ditions are satisfied (171). Two of these 
conditions are particularly important 
for determining the acceptability of 
conducting placebo-controlled trials 
after a vaccine candidate has been 
found to be safe and efficacious: Is the 
trial’s risk-benefit profile still accept- 
able? Do participants consent? 


ACCEPTABLE RISK-BENEFIT PROFILE 
Continuing a blinded, placebo-con- 
trolled trial after the vaccine can- 
didate being tested has been found 
to be safe and efficacious is justified 
only when the trial’s_ risk-benefit 
profile remains acceptable. Making this 
determination requires answering three 
questions: (1) What is the social value of 
continuing the trial? (ii) What are the risks 
to participants of continuing the trial? (iii) 
Are the risks to participants sufficiently low 
and justified by the social value of continu- 
ing the trial? 


Social value 

Continuing a trial after the vaccine candi- 
date has been found to be safe and effica- 
cious can provide an opportunity to collect 
several types of socially valuable data. Of 
greatest importance, it can provide greater 
confidence in and a more precise point es- 
timate of the vaccine’s efficacy and offer 
an opportunity to collect additional safety 
data, including data on any uncommon or 


1278 11 DECEMBER 2020 « VOL 370 ISSUE 6522 








delayed side effects. It can also help to as- 
sess how long the vaccine’s protective effect 
lasts; offer insight into the vaccine’s impact 
in various subgroups, such as older individ- 
uals or those with comorbidities; and evalu- 
ate whether the vaccine candidate protects 
against infection itself. 


Risks to participants 

Once a vaccine candidate is found to be ef- 
ficacious, participants in the placebo arm 
are known to be at higher risk of symptom- 
atic disease than those in the active arm. 
The degree of risk depends on the chances 
that participants in the placebo arm will 
become infected, the risks they face if they 
are, and how much protection the vaccine 
candidate offers. 


The chances that participants in the pla- 
cebo arm will be infected depends on the 
local transmission rate, preventive measures 
they adopt, and the amount of time they 
remain on placebo. When participants are 
on placebo for a short time, the chances of 
infection are correspondingly low. For exam- 
ple, Moderna plans to continue its placebo- 
controlled trial until 151 cases of symptom- 
atic disease are identified (5). Given current 
transmission rates, this may take as little as 
a few weeks. 

Remaining on placebo for a few weeks, 
rather than accessing an efficacious vac- 
cine, poses a low chance of substantial 
harm. Continuing on placebo for even lon- 
ger periods also poses a low chance of sub- 
stantial harm to individuals at low risk for 
severe disease. 
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Remaining on placebo for an extended 
period can pose considerable risks to indi- 
viduals at high risk of severe disease. The 
extent of these risks depends critically on 
what options are available to them. In the 
setting of few effective treatments and po- 
tentially strained hospital systems, receiv- 
ing placebo for an extended period rather 
than a safe and efficacious vaccine can 
pose substantial risks. However, if high- 
risk individuals would not have access to a 
safe and efficacious vaccine outside of re- 
search—for example, there is only enough 
supply for the trial or they are not part of a 
prioritized group that will receive the vac- 
cine during the time of the trial—receiving 
placebo in a clinical trial poses few addi- 
tional risks to them. 





A volunteer receives an injection as part of a phase 3 trial of the vaccine candidate developed by Pfizer and BioNTech. 


Are risks sufficiently low and justified? 
There is no algorithm for determining how 
much social value a given clinical trial has 
and whether its social value justifies the 
risks participants face. As a result, IRBs 
tend to focus on ensuring that a trial has 
the potential to collect important data and 
that the risks of substantial harm are low. 
Trials with the potential to collect data 
helpful for addressing a global pandemic 
have considerable social value. Inviting 
competent adults to participate in such 
trials can be ethical when doing so poses 
a small increase in their risk of experienc- 
ing substantial harm. This suggests that it 
can be ethically acceptable to continue a 
placebo-controlled trial for a short period 
after the vaccine candidate has been found 
to be safe and efficacious, even when par- 
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ticipants might be able to access the vac- 
cine candidate outside the trial, for ex- 
ample, through an EUA (12). By contrast, if 
continuing the trial does not offer the op- 
portunity to collect socially valuable data, 
or comparable data can be obtained in less 
risky ways, continuing the trial with a pla- 
cebo arm for any length of time would be 
ethically problematic. 

Inviting participants who are at low risk 
of severe disease to remain blinded and stay 
in the trial for a longer period can be ac- 
ceptable when it offers the potential to col- 
lect data that might be helpful for address- 
ing the pandemic. In most cases, continuing 
a blinded, placebo-controlled design with 
high-risk individuals for longer periods will 
not yield data of sufficient value to justify 
it. Exceptions might include when the indi- 
viduals cannot access an efficacious vaccine 
outside the trial and their participation is 
needed to collect valuable data, or they are 
in a group for whom no efficacious vaccine 
candidate has been identified. Otherwise, 
individuals at high risk of severe disease 
should be unblinded and those on the pla- 
cebo arm offered the vaccine within a re- 
designed study or given the opportunity to 
seek the vaccine outside the trial. 

When the value of the data to be collected 
does not justify the risks of continuing the 
trial as designed, researchers have several 
options. They can unblind participants; of- 
fer placebo recipients the vaccine, possibly 
as part of an expanded access program; and 
follow them to collect additional data (73). 
Alternatively, researchers might redesign 
the trial, for example, to include a crossover 
in which the blind is maintained and those 
on the placebo arm receive the vaccine af- 
ter they complete the placebo arm. Finally, 
in some cases, it may make sense to simply 
stop the trial and unblind participants, thus 
allowing those in the placebo arm to seek 
the vaccine elsewhere. 


OTHER TRIALS 
Even when a vaccine candidate is found to 
be safe and efficacious, there are likely to be 
good reasons to study others. Another vac- 
cine candidate might be more effective, gen- 
erate longer-lasting immunity, work better 
in certain subpopulations, provide greater 
protection against severe disease, or pre- 
vent infection better. Other candidates may 
also be superior with respect to cost or other 
practical considerations. For example, Pfiz- 
er’s vaccine candidate must be stored at very 
low temperatures until 5 days before admin- 
istration, a requirement which will reduce its 
availability in many parts of the world. 
When there are safe and effective inter- 
ventions available, the default is to use them 
rather than placebo as the comparator in 
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clinical trials. Thus, if the Pfizer or Moderna 
vaccine candidates, or others, are confirmed 
to be safe and efficacious, researchers should 
consider whether their trials can be rede- 
signed as a comparison between the vaccine 
candidate they are testing and a safe and effi- 
cacious vaccine. When making this decision, 
researchers and IRBs should take into ac- 
count that an active comparison trial is likely 
to require larger sample sizes and extend the 
duration of the trial. 

Redesigning ongoing trials to include an 
efficacious vaccine as an active comparator 
may not always be feasible or valuable. If 
continuing to evaluate the vaccine candi- 
date nonetheless offers important social 
value, participants at low risk of severe 
disease might be invited to stay blinded 
and remain in the placebo-controlled trial. 
If continued participation appreciably in- 
creases participants’ risk of severe disease, 
researchers should discuss participants’ op- 
tions for obtaining an efficacious vaccine 
outside research. 


DISCLOSURE, UNBLINDING, CONSENT 
Researchers are ethically obligated to inform 
participants of developments that might in- 
fluence their willingness to remain in a clini- 
cal trial. Clearly, that a vaccine candidate has 
been found to be safe and efficacious meets 
this standard. Hence, investigators should 
inform participants in all trials of such a 
finding. This information should include the 
vaccine’s safety record, the level of protection 
it provides, the populations for which it has 
been found to be safe and efficacious, and 
whether it might be available through an 
EUA or other means. 

Researchers should then explain the plans 
for their trial going forward and solicit par- 
ticipants’ consent. To minimize potential 
confusion and distrust (/4), participants 
should understand the rationale behind the 
plan and that their prior consent does not 
obligate them to continue to participate. If 
researchers intend to continue with a pla- 
cebo-controlled design, participants can be 
encouraged to remain in the trial. However, 
once a safe and efficacious vaccine candi- 
date is available, knowing whether they re- 
ceived placebo or vaccine becomes relevant 
to a participant’s decision whether to seek 
the vaccine outside research. Hence, partici- 
pants who choose to leave the trial should be 
informed whether they received the vaccine 
candidate or placebo. 

If one or more efficacious vaccines become 
widely available outside research, continu- 
ing a blinded, placebo-controlled trial might 
result in participants in both the active and 
placebo arms dropping out and seeking the 
vaccine elsewhere. Researchers should an- 
ticipate this possibility and consider whether 
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they might unblind participants, offer an ef- 
ficacious vaccine to the placebo recipients, 
and follow everyone. Alternatively, it might 
make sense to continue the blinded, placebo- 
controlled trial with a guarantee that indi- 
viduals in the placebo arm will receive an 
efficacious vaccine once their participation 
in the study is completed. With careful plan- 
ning and systematic assessment of the social 
value and risks, we believe it can be ethical 
to conduct some placebo-controlled trials of 
vaccine candidates for COVID-19 even after 
we have an efficacious vaccine. Doing so may 
be necessary to effectively address a pan- 
demic that is causing so much harm around 
the world. 
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Unlike this composite image of galaxies visualized by NASA's Galaxy Evolution Explorer (GALEX) spacecraft, the Universe is not boxed in, argues Barbour. 


An outside-the-box take on time 


A physicist’s provocative theory offers an optimistic view of our cosmic destiny 


By Matthew Johnson 


he history of the Universe thus far has 
certainly been eventful, marked by 
the primordial forging of the light ele- 
ments, the birth of the first stars and 
their violent deaths, and the improb- 
able origin of life on Earth. But will 
the excitement continue, or are we headed 
toward the ultimate mundanity of equilib- 
rium in a so-called heat death? In The Janus 
Point, Julian Barbour takes on this and other 
fundamental questions, offering 
the reader a new perspective—il- 
lustrated with lucid examples and 
poetically constructed prose—on 
how the Universe started (or more 
precisely, how it did not start) and 
where it may be headed. This book 
is an engaging read, which both 
taught me something new about 
meat-and-potatoes physics and re- 
minded me why asking fundamen- 
tal questions can be so fun. 
Barbour argues that there is no 
beginning of time. The Big Bang, 
he maintains, was just a very special configu- 
ration of the Universe’s fundamental build- 
ing blocks, a shape he calls the Janus point. 
AS we move away from this point, the shape 
changes, marking the passage of time. The 
“future,” he argues, lies in both directions, 
hence the reference to Janus, the two-faced 
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The Janus Point: 

A New Theory of Time 
Julian Barbour 
Basic Books, 2020. 
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Roman god of beginnings and transitions. 

Barbour illustrates his main points with a 
deceptively simple model known as the three- 
body problem, wherein three masses are sub- 
ject to mutual gravitational attraction. In this 
context, the Janus point occurs when all three 
masses momentarily occupy the same point, 
in what is called a total collision. The special 
shape at the Janus point, explains Barbour, is 
an equilateral triangle, which is his model’s 
version of the Big Bang. I found this imagery 
helpful when trying to understand the more 
abstract, and necessarily less tech- 
nical, application of this concept to 
general relativity. 

One of the main thrusts of Bar- 
bour’s proposal is an attempt to 
reconcile cosmological evolution 
with the second law of thermody- 
namics and an arrow of time. As 
a reminder, the second law is the 
one that tells us that systems tend 
toward increasing entropy, some- 
times colloquially referred to as 
disorder. In a closed system, if we 
begin with order (low entropy), 
then disorder will grow until the entropy 
reaches a maximum. The change in entropy 
can be used to define past (low entropy) and 
future (high entropy). 

When applied to the Universe, the second 
law of thermodynamics evokes a number of 
questions. How did the Universe come to be 
in a state of low entropy, for example? Bar- 
bour’s answer to this is the Janus point. In 
the three-body problem, the Janus point (i.e., 
a total collision) is an inevitability so long 
as the total angular momentum vanishes. 
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Recall that angular momentum is among 
the sacred conserved quantities in classi- 
cal mechanics, so vanishing at one time 
implies vanishing for all times. Barbour ar- 
gues by analogy that the Universe, similarly 
restricted, must have a Janus point too. If 
true, the seemingly special configuration at 
the Janus point is not so special at all but in 
fact required by a conservation law. 

So, what happens when entropy reaches a 
maximum? Barbour dodges this question by 
arguing that entropy is not the right factor to 
consider and that we should instead look to 
complexity. With time, he argues, the build- 
ing blocks of the Universe can arrange them- 
selves into increasingly complex structures. 

Barbour does not buy the argument that 
the Universe is akin to a closed system whose 
entropy can reach a maximum and instead 
takes the optimistic outlook that complexity 
has no bound and our cosmic party can go on 
forever. But could this be so? The observed 
accelerated expansion of the Universe, the 
most likely explanation of which is a cosmo- 
logical constant, suggests otherwise. A Uni- 
verse with a cosmological constant is one that 
has event horizons—points of no return. And 
just as Stephen Hawking showed with black 
holes, the event horizon has a temperature 
and forms what can be thought of as a sort of 
box. Whether this means we should think of 
the Universe as a closed system remains un- 
clear, but if that is the case, the entropy of the 
system will eventually be maximized. Change 
will cease, save for fleeting departures from 
equilibrium, and mundanity will prevail. Let 
us hope that Barbour is right. 

10.1126/science.abf2440 
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Social capital’s value in the courtroom 


Inequalities are reinforced during court proceedings and deliberations, finds a sociologist 


By Louise Seamster 


e are used to thinking about ra- 
cial disparities in policing and 
arrests and disparities in sen- 
tencing and imprisonment, all 
of which produce pathways with 
multiple off-ramps for white and 
wealthy defendants. In his new book, Priv- 
ilege and Punishment, sociologist Matthew 
Clair probes a less-examined period that 
exists between these two stages: the ac- 
tual court proceedings and deliberations. 
Synthesizing knowledge gained from court 
watching, interviews with defendants and 
lawyers, and shadowing three public de- 
fenders in the Boston area, Clair zeroes in 
on this interval—when defendants’ expe- 
riences should theoretically be relatively 
equal—to look at the subtle forms in which 
inequality can manifest. 

Many treatments of racial inequality 
within the justice system assume that racial 
disparities must be produced through dis- 
crete acts of discrimination. But outright 
or obvious acts of racism are relatively rare 
in Clair’s observations. His relational ap- 
proach, which focuses on the relationship 
between lawyers and defendants, shows 
how inequality works more subtly, through 
ongoing interactions. 

Researchers studying other interac- 
tional settings, such as schools and doc- 
tors’ offices, have found that poor and 
working-class people are disadvantaged 
through patterns of deferential behavior. 
Meanwhile, middle-class actors are able to 
deploy assertive “styles of interaction that 
are either valued by gatekeepers...or that 
force these gatekeepers to provide them 
with more resources than their peers” But 
in interactions with one’s lawyer, something 
like the opposite seems to happen: Poor and 
working-class defendants’ attempts to wield 
knowledge, offer strategies, or otherwise ad- 
vocate on their own behalf frequently back- 
fire, finds Clair. Indeed, defendants’ very 
familiarity with legal protocol, strategy, and 
baroque laws can serve as a strike against 
them, showing that they are not first-time 
offenders. Middle-class defendants, mean- 
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while, often take a more passive or defer- 
ential stance, conveying a trusting attitude 
toward their lawyer that apparently bodes 
well for the relationship. 

Expertise is thus only valued in the 
“proper” context. Assertive defendants are 
perceived as potentially problematic and 
disrespectful of the process, privileging the 
authority and voice of lawyers and judges. 
Such defendants can gum up the works of 
an otherwise smoothly running system. 

Inexpert attempts at self-advocacy could 
also serve a social bonding function for 
insiders. The development of professional 
in-groups can lead some clients to believe 
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A defendant’s lack of deference to the legal system 
can tip the scales of justice against him, finds Clair. 


that their lawyer is working “hand-in-hand 
with the DA and the judge,” as one defen- 
dant put it: “These people, after they leave 
[court], they go sit and do lunch.” 

Clair’s findings show the weak spot in ex- 
hortations that marginalized people should 
simply adopt the successful behaviors of 
the well-off in such situations. Just as “suc- 
cessful” parenting techniques of a given age 
tend to be whatever the upper class is doing 
at that moment, regardless of broader effi- 
cacy, there is no universally effective man- 
ual for navigating legal institutions. 
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In the case of the courtroom, Clair shows 
that a defendant’s social capital matters 
more than his or her technical knowledge. 
Several middle-class clients he observed 
were able to build a positive rapport with 
their lawyers through shared interests or 
culture. This is a complex performance 
of concurrent insider and outsider status, 
conveying ease and identification with 
the professional realm of the court and an 
outsider identity relative to the “criminal 
world.” Clair suggests that these distinc- 
tions and cultural performances drive 
material outcomes, from verdicts to sen- 
tencing, as well as the degree to which a 
judgment affects one’s life in the long term, 
thanks to expungement and record sealing. 
(Recall the judge sentencing Stanford rap- 
ist Brock Turner to just 6 months in jail, 
who predicted that prison “would have a 
severe effect” on Turner.) 

Clair details some of the Kafkaesque bar- 
gains facing defendants. Accept the plea 
bargain or maintain your innocence and 
face greater retribution? Aim for probation 
or just take the jail sentence because pro- 
bation is so hard to complete successfully 
without sufficient resources? In detailing 
these decisions, he successfully illuminates 
the distance between defendants’ experi- 
ences and restorative justice. 

The book is worth reading to the end for 
Clair’s take on why this constitutes not just 
inequality but injustice—and what ought 
to be done about this injustice, in the con- 
text of contemporary abolition movements. 
Here, he suggests changes at multiple lev- 
els, ranging from shifting judge discretion 
and rules around lawyer selection to atten- 
uating state involvement in punishment 
by removing laws and creating restorative 
justice processes. The biggest change he 
advocates involves shifting resources out- 
side the criminal justice system altogether 
to meet people’s other needs, rebuilding 
what sociologist Bruce Western, quoted 
here, calls “thick public safety.” 

10.1126/science.abe8289 
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Renewables in Spain 
threaten biodiversity 


Reducing greenhouse gas emissions is 

a global priority. To meet this goal, the 
Spanish government is planning 89 GW of 
wind and solar photovoltaic energy in the 
draft of the National Integrated Energy 
and Climate Plan (PNIEC) for 2021-2030 
(1). Despite the Spanish government’s 
efforts to prevent a speculative bubble in 
the secondary market, there are already 
grid access permits for projects represent- 
ing 121 GW that will be added to the 36 
GW of renewables already installed (2), 
almost doubling the goals of the PNIEC. 
The new projects will affect hundreds of 
thousands of hectares, and there is no 
way to offset the huge quantity of valuable 
habitats that could be lost. Spain should 
adopt a more cautious approach to pre- 
vent a scenario in which energy goals are 
met at the expense of biodiversity. 

Many renewables projects are planned 
in low-cost marginal soils of high eco- 
logical value, such as extensive cereal 
farmlands and wild mountain ranges 
harboring steppe birds and raptors that 
have in Spain their largest European or 
world populations. Photovoltaic energy 
needs huge amounts of land and will 
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mostly affect declining species of steppe 
birds, which are poorly represented in 
the Spanish Natura 2000 network (3). 
Globally threatened large scavengers and 
other unique and scarce soaring birds 
are already paying a heavy toll due to the 
approximately 20,000 existing turbines, 
with demographic consequences for some 
threatened populations (4, 5). Bat-killing 
figures are even higher, with a minimum 
of 200,000 deaths per year according to 
estimated mortality rates (6). 

Studies forecasting mortality have 
shown scarce predictive power (7), and 
when mortality hotspots are detected, 
conflictive turbines are virtually never 
stopped to reduce bat and bird casual- 
ties as recommended (8). As a large-scale 
approach, the best way to reduce impacts 
is by choosing adequate locations (9). 
However, updated field information 
necessary to achieve this goal is often 
not available, and projects are authorized 
in areas with under-protected species 
because their status within regional and 
state listings is frequently outdated. 

Studies designed to predict and monitor 
the incidence of renewable infrastruc- 
tures are funded by energy companies 
(10), often with little supervision by 
governments, which precludes indepen- 
dence. The problem is exacerbated by the 
fragmentation of large projects (yielding 
reports of the smaller impacts associated 
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Birds such as this red kite (Milvus milvus) are put 
at risk by the proliferation of Spanish wind farms. 


with each part rather than a comprehen- 
sive assessment of the full project), the 
absence of an in-depth assessment of 
cumulative and synergistic environmental 
impacts, and decentralized administrative 
authority divided among the central state, 
regions, and municipalities. 

We welcome renewables, but we urge 
Spanish authorities to correct these 
deficiencies and implement rigorous com- 
prehensive planning based on the most 
updated ecological knowledge. We also 
call for a stronger commitment to more 
distributed and energy-saving policies 
that would reduce direct environmental 
impacts on biodiversity, such as energy 
efficiency, self-consumption, and improved 
energy performance in buildings. 
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Small Aral Sea brings 
hope for Lake Balkhash 


In the recent Letter “Save Kazakhstan’s 
shrinking Lake Balkhash” (16 October, 

p. 303), A. Ussenaliyeva calls for inter- 
national attention to the ecological 

crisis now facing Lake Balkhash. Similar 
problems face other water bodies without 
effluents, such as the Great Salt Lake 

(7) and Lake Urmia in Iran (2). A paral- 
lel crisis severely damaged the Aral Sea 
(2, 3), located between Kazakhstan and 
Uzbekistan. The action taken in response 
could provide a model for lakes suffering 
from desiccation. 

Starting around 1960, diversion of 
inflowing water for irrigation purposes 
caused an accelerating regression and 
salinization of the Aral Sea (3). By 1990, 
the remaining waters sustained only 
organisms able to thrive in polyhaline 
conditions, and commercial fisheries, for- 
merly of prime importance, had vanished 
(4). The desiccation also caused multiple 
severe health problems due to airborne 
dust (5). In 1992, a primitive dam was 
constructed across the Berg Strait to 
retain water in the northern Small Aral 
Sea, demonstrating a cost-effective way 
to maintain a low level of salinity. With 
financial help from the World Bank, the 
structure was later replaced by a stronger 
dam (3, 6), which succeeded in rapidly 
restoring the Small Aral. 

With increasing water volume and 
decreasing salinity, species that had 
become extinct during the regression 
crisis repopulated the sea by natural 
means from refugia in the affluent Syr 
Darya river system (7). Commercial fisher- 
ies that depend on the lake are thriving 
again (7, 8). The Aral Sea crisis foreshad- 
ows what may face Lake Balkhash if no 
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timely action is taken. It also shows that 
a relatively small financial effort can yield 
substantial results. The Aral Sea is sur- 
rounded by several countries that should, 
jointly with the international community, 
take renewed action to preserve even 
more of this system. This coalition should 
also lend its expertise to determine 

how attention and investment can best 
prevent a similar ecological crisis at Lake 
Balkhash and the Great Salt Lake. 
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Brazil’s areas of not-so- 
permanent preservation 


Recently, the National Council for the 
Environment in Brazil (CONAMA) repealed 
CONAMA Resolution 303/2002 (1), which 
has, for nearly two decades, set the rules 
for nation-wide Areas of Permanent 
Preservation (APPs). Unlike the Protected 
Areas officially set aside for conservation by 
the government or private property owners 
(2), APPs in Brazil require no governmental 
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intervention to remain protected. Those 
who favor this repeal argue that CONAMA 
303/2002 and its rules for APPs have been 
overwritten by the Forest Act approved in 
2012 as Federal Law 12.651 (3, 4). Although 
the Forest Act does cover the protection 

of some important ecosystems, such as man- 
groves (3), there are crucial environmental 
safeguards at risk should the repeal of 
CONAMA 303/2002 go into effect. 

Under the Forest Act alone, coastal veg- 
etation is an APP only if it acts to stabilize 
sand dunes, whereas CONAMA 303/2002 
considers as APP all the coastal vegeta- 
tion within 300 meters of the maximum 
height reached by a rising tide. Conversely, 
sand dunes will lose their protection as 
APPs unless they are stabilized by coastal 
vegetation. In addition, areas used for 
reproduction of migratory birds will no 
longer be classified as APPs. With the repeal 
of CONAMA 303/2002, habitat areas for 
threatened species will retain their categori- 
zation as APPs only if officially declared by 
the government, which rarely occurs. 

After weeks under criticism (5), the repeal 
of CONAMA 303/2002 was temporarily 
suspended until the ultimate decision by 
the Supreme Federal Court in Brazil (6). We 
urge the Court to recognize the importance 
of these environmental safeguards remain- 
ing in place. In times of rapid climate 
change (7), sea level rise (8), and extinction 
crisis (9), the repeal of CONAMA 303/2002 
will only jeopardize coastal management 
efforts and increase threats to species of 
conservation concern. 
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The sand dunes of Joaquina Beach in southern Brazil could soon lose their protected status. 
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Comment on “Forest microclimate dynamics 
drive plant responses to warming” 
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Zellweger et al. (Reports, 15 May 2020, p. 
772) claimed that a microclimatic debt, 
mainly controlled by canopy buffering, 
evolved in European forest understories. 
However, their analysis is based on circu- 
larity, as they explained the sum of three 
components by one of these components. 
The response of the understory to the ther- 
mal environment is generally weak. 
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Schall and Heinrichs question our interpre- 
tation that the climatic debt in understory 
plant communities is locally modulated 

by canopy buffering. However, our results 
clearly show that the discrepancy between 
microclimate warming rates and thermo- 
philization rates is highest in forests where 
canopy cover was reduced, which suggests 
that the need for communities to respond 
to warming is highest in those forests. 
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Zellweger et al. (Reports, 15 May 2020, p. 772) claimed that a microclimatic debt, mainly controlled by 
canopy buffering, evolved in European forest understories. However, their analysis is based on circularity, 
as they explained the sum of three components by one of these components. The response of the 
understory to the thermal environment is generally weak. 


In their recent paper, Zellweger et al. (1) quantified thermal 
macroclimate (MaC), thermal canopy buffering (CB), and 
community-based floristic temperature of understory plants 
(FT) for two points in time, a baseline survey and a recent 
resurvey, for plots located in 56 European regions. Decadal 
change rates of macroclimate (AMaC), canopy buffering 
(ACB), and floristic temperature (AFT, called thermophiliza- 
tion rate) were used to adjust for varying survey intervals. 
Microclimate change rate (AMiC) was defined as the sum of 
macroclimate change rate and canopy buffering change rate 
(AMiC = AMaC + ACB). The authors found a significant rela- 
tionship between AMiC and AFT, whereas AMaC was not 
related to AFT. Reanalyzing this relationship using provided 
data (2) revealed a low effect size of 0.051° + 0.012°C °C" 
(mean + SEM) and a large scatter (R? = 1.1%; Fig. 1A). Thus, 
contrary to the authors’ claim “that microclimate change 
ultimately drives organismal responses to climate change,” 
microclimate change seems not to be the main driver of 
thermophilization in forest understories. 

The authors continued by investigating how climate 
change affects a climatic debt. Climatic debt (ACD) was cal- 
culated by subtracting AFT from microclimate change rate 
(ACDmic = AMaC + ACB - AFT) and macroclimate change 
rate (ACDwac = AMaC - AFT) for each plot. Because AFT only 
weakly followed the climate change (Fig. 1A), a climatic debt 
automatically appeared. The authors argue that the under- 
story lagged behind climate warming, leading to the accu- 
mulation of this climatic debt, but the validity of this 
assumption was not proven. Although a reanalysis of the 
baseline survey data (using floristic temperature data kindly 
provided by Zellweger) showed a stronger relationship be- 
tween floristic temperature and climate than was found be- 
tween rates, the slope was far from the assumed 1:1 
relationship (Fig. 1B). 
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Thus, even for the baseline, floristic temperature was 
not strongly controlled by macroclimate and even less by 
microclimate; this does not correspond to the assumed di- 
rect linkage proposed by the climatic debt concept. These 
relationships have also not significantly changed for the 
resurvey, indicating that the understory community is stable 
in its thermal traits across space and time (Fig. 1, B and ©). 
This also means that colder and warmer (closed and open) 
forests comprised nearly identical floristic temperatures. 
Rather than lagging behind temperature changes, forest 
plant communities are possibly well adapted to the periodic 
changes of canopy cover in forests (3) and often benefit 
from increased light availability after canopy opening (4). 
Therefore, climatic debt seems to be a construct not sup- 
ported by the floristic data. 

Zellweger et al. used this construct and related it to 
canopy buffering. The authors achieved an explained vari- 
ance for microclimatic debt of 28.9%, suggesting that warm- 
ing by reduced canopy buffering drives microclimatic debt 
and concluding that “increasing climatic debts in communi- 
ty responses to climate change mean that a growing number 
of species are occurring in suboptimal climate conditions, 
potentially accelerating the loss of biodiversity” and “that 
climate change impacts on forest plant communities have 
been reduced by higher standing stocks and associated cool- 
ing after increases in thermal buffering.” These conclusions 
are, however, based on circular reasoning, as Zellweger et al. 
explained microclimatic and macroclimatic debt by a vari- 
ant of the canopy buffering change rate, namely the canopy 
buffering absolute change (CBaiz = CBr - CBs), which is 
highly correlated with ACB (Pearson’s r = 0.93), using linear 
mixed-effect models with region as a random effect. In do- 
ing so, Zellweger et al. explained a sum with a component of 
the sum, leading to dependency between explanatory and 
response variables. The authors basically analyzed the effect 
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of canopy buffering on climatic debt by modeling (AMaC + 
ACB - AFT) = f(ACB) for microclimatic debt and (AMaC - 
AFT) = f(ACB) for macroclimatic debt. Predictably, micro- 
climatic debt but not macroclimatic debt was driven by can- 
opy buffering, but this is only because canopy buffering 
explains itself in the microclimate case. We used hierar- 
chical partitioning (5) to disentangle the contributions of 
the components of microclimatic debt (i.e., AMaC, ACB, 
AFT) on microclimatic debt (Table 1) and to explicitly show 
what the authors did. Microclimatic debt is consecutively 
driven by AMaC (45.9%), ACB (33.7%; AMiC = AMaC + ACB: 
80.2%), and AFT (15.9%), and to 100% explained by all of its 
three components. Because of this circularity, the relation- 
ship between microclimatic debt and canopy buffering pre- 
sented by Zellweger et al. does not provide any evidence for 
a response of the understory community to climate change. 
Thus, the conclusion of the authors that “a reduction of 
canopy cover impedes the ability of understory plant com- 
munities to respond to such high rates of warming” is not 
proven by the data. 

We agree that forest understories experience climate 
change, but the floristic temperature seems to be relatively 
stable, at least until now (Fig. 1). There is no evidence that 
the relationship between the thermal affinity of understory 
plant communities and canopy closure or openness has 
changed markedly between the baseline survey and the re- 
survey. This is also supported by the weak relationship be- 
tween thermophilization and climate change reported by 
Zellweger et al. Thus, there is no reason to suggest that can- 
Opy openings will create adverse thermal conditions for 
many species, in turn influencing forest management deci- 
sions. 

Long-term data are highly valuable in detecting the ef- 
fect of environmental change on ecosystems. Since the 2009 
publication by Keith et al. (6), evidence for a homogeniza- 
tion and diversity loss of European forest understories has 
been increasing, with shade-tolerant, nutrient-demanding 
generalists expanding while light-demanding indicators of 
nutrient-poor sites disappear (7-10). Recently, Staude ez al. 
(11) verified this using data from the same database as Zell- 
weger et al. (1). Thus, when canopy openings (either natural 
or due to forest management) coexist with closed forests, 
they can rather be seen as a chance to maintain heterogene- 
ity in forest understories (12) and to conserve plant species 
that benefited from traditional forest management practices 
centuries ago (13, 14). By reducing the variation of canopy 
closure, we run the risk of accelerating the biodiversity loss 
in forest understories. 
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Table 1. Independent and joint contribution of macroclimate change rate, canopy buffering change rate, and 
thermophilization rate to microclimatic debt from hierarchical partitioning. The total contribution reflects the 
explained variance R? of single linear models. A partitioning with microclimate change rate (AMiC) instead of its 
components (AMaC and ACB) contributes 80.2% to microclimatic debt. Considering region as a random effect in 
mixed models affects results only marginally (R*margina? AMIC, 79.1%; AMaC, 45.7%; ACB, 33.6%; AFT, 13.5%). 


Independent Joint contribution Total 

Macroclimate change rate (AMaC) 44.9% 1.0% 45.9% 

Canopy buffering change rate (ACB) 37.3% -3.6% 33.7% 

Thermophilization rate (AFT) 17.8% -1.9% 15.9% 

Sum 100.0% -4.5% 95:5% 
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1.07 Climate Á 
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Floristic temperature (°C) 
Floristic temperature (°C) 
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Fig. 1. Microclimate is only weakly related to floristic temperature and its change rate. (A) Mean effect size 
(+SEM) of microclimate change on thermophilization, based on linear mixed-effect models with region as a random 
effect, is 0.051° + 0.012°C °C decade“. It contributes only slightly to the explained variance of thermophilization 
as indicated by a marginal (conditional) R* value of 1.1% (11.8%). Mean effect size (£SEM) of macroclimate change 
on thermophilization is -O.010° + 0.026°C °C! decade™ [R*: 0.0% (10.1%)]. The scatter shown is adjusted for the 
random effect. Black dashed lines indicate the 1:1 relationship and the zero line. (B and C) Mean effect size (4SEM) 
of maximum temperature on floristic temperature, based on linear mixed-effect models with region as a random 
effect, was about double for macroclimate relative to microclimate for the baseline survey (0.137° + 0.017°C °C? 
versus 0.052° + 0.013°C °C7!; R*: 11.1% versus 1.8%) (B) as well as for the resurvey (0.112° + 0.018°C °C! versus 
0.069° + 0.013°C °C: R*: 6.6% versus 2.8%) (C). The scatter shown is adjusted for the random effect. Vertical 
dashed lines show the means of macro- and microclimate. The 1:1 relationship is indicated by the black dashed line. 
Slopes comparing the baseline and the resurvey are not significantly different. 
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Schall and Heinrichs question our interpretation that the climatic debt in understory plant communities is 
locally modulated by canopy buffering. However, our results clearly show that the discrepancy between 
microclimate warming rates and thermophilization rates is highest in forests where canopy cover was 
reduced, which suggests that the need for communities to respond to warming is highest in those forests. 


Biological communities accumulate a climatic (thermal) 
debt when their response to warming does not keep up with 
the warming rate itself. Forest understory plant communi- 
ties appear to respond particularly slowly to warming, and 
thus climatic debts are commonly observed in forest under- 
story plant communities (/, 2). In line with conventional 
approaches used in the cited literature [e.g., (7, 2)], we de- 
fine the climatic debt as the difference between the ther- 
mophilization rate and the rate of climate warming, which 
are two independently calculated variables. 

Schall and Heinrichs (3) question the validity of our cli- 
matic debt analysis because there is no 1:1 relationship be- 
tween the floristic temperatures and the macro- and 
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microclimate temperatures at one point in time. In this dis- 
cussion, it is important to keep in mind that the commonly 
applied approach to inferring temperature conditions from 
plant species composition data (1.e., floristic temperatures) (4, 
5) is not designed to accurately reflect the actual tempera- 
tures at a particular time step across space, because these 
data are based on broad distribution ranges of individual 
species co-occurring in the community and are subject to 
considerable uncertainty, as outlined in detail by Rodriguez- 
Sanchez et al. (6). Thus, caution is needed when using such 
floristic temperatures to study spatial variation of community 
temperature preferences along a climatic gradient, as done by 
Schall and Heinrichs, especially when the studied macrocli- 
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matic gradient is relatively short, as is the case in our study 
(the interquartile ranges of spatial macroclimate tempera- 
tures in the baseline and resurveys were only 2.4° and 2.2°C, 
respectively). However, in our paper we studied temporal, not 
spatial, changes in floristic temperatures per unit time, and 
for this purpose floristic temperatures have been shown to be 
unbiased and robust, providing a solid database for our 
thermophilization and climatic debt analyses (6). Further- 
more, our floristic temperatures per species are based on 
long-term macroclimate data during the vegetation period, 
whereas the macro- and microclimate change data were cal- 
culated independently on the basis of summer maximum 
temperatures prevailing in the 5 preceding years of the re- 
spective field surveys. Schall and Heinrichs’ finding that the 
floristic temperatures at one time step more closely follow the 
macroclimate than the microclimate temperatures is also 
very plausible and expected, exactly because the floristic tem- 
peratures were calculated from macroclimate data, not mi- 
croclimate data, which are not yet available at such scales. 
The patterns shown in Schall and Heinrichs’ figure 1, B and C, 
can thus be explained very well and provide no reason to 
question the validity of our climate debt assessment. We also 
note that the mean thermophilization rates we found in our 
study compare well to the rates found in other lowland for- 
ests in Europe (J), as well as in forests in the Andes (2), and 
that the variation of our floristic temperatures for a given 
unit of spatial macroclimate change is similar to the respec- 
tive variation found in these two studies. Schall and Hein- 
richs’ claim that the climatic debt seems to be a construct not 
supported by the floristic data does not hold. 

Schall and Heinrichs further question our interpreta- 
tion of the effect of canopy buffering on climatic debt. As 
outlined above, and in agreement with Schall and Heinrichs’ 
concerns, it is obvious that microclimate warming and mi- 
croclimatic debt are not independent from each other. In 
fact, given the frequently observed slow response of forest 
plant communities (/, 2), it can be expected that microcli- 
mate debts are more pronounced in areas with high rates of 
warming. We show that changes in temperature buffering 
due to dynamics in canopy cover are an important and inte- 
gral component of forest microclimate warming. Regulating 
effects of canopy opening (e.g., due to clearcutting or tree 
mortality) on forest-floor temperature can act independent- 
ly from macroclimate warming (7). It is thus reasonable to 
expect that changes in canopy-modulated temperature buff- 
ering are related to the microclimate debt, but not to mac- 
roclimate debt. What we show in figure 3A of (7) is the 
contribution of the change in temperature buffering to the 
microclimatic debt. We argue that this is an interesting con- 
tribution because (i) it can be substantial but has been 
largely ignored in the literature, and (ii) it shows how the 
need for communities to respond to warming is locally 
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modulated by canopy cover dynamics. We agree with Schall 
and Heinrichs’ claim that the relationship between micro- 
climate debt and canopy buffering does not provide evi- 
dence of a response of the understory community to climate 
change. Evidence of such a response is provided in figure 2 
of (7). However, the said relationship shows how the need 
for communities to respond to warming is locally modulated 
by canopy buffering, despite the non-independence between 
microclimate debt and microclimate warming. An ecologi- 
cally realistic assessment of how much communities lag be- 
hind warming requires data about microclimate warming. 
In contrast to Schall and Heinrichs’ critique and given our 
finding that thermophilization is more related to microcli- 
mate warming than to macroclimate warming [figure 2 of 
(7)], we indeed argued that microclimate warming, and not 
macroclimate warming, ultimately drives organismal re- 
sponses to warming. These results also fully support our 
conclusion that a reduction in canopy buffering leads to 
higher rates of warming, thus seriously increasing the pres- 
sure for plant communities to respond to warming. 

We agree with Schall and Heinrichs’ conclusion that 
understory plant communities have responded only weakly 
to warming temperatures. That is actually the crucial point, 
because at the same time the temperatures themselves have 
warmed markedly, causing a climatic debt. It may indeed be 
that the microclimate temperature variations observed with- 
in our studied forests were within the thermal tolerances of 
most species, but given their slow response, the currently 
observed climate warming rates continue to erode their 
thermal safety margins, especially at their equatorward 
range boundaries. This has serious implications for future 
forest diversity and function (8) and will likely be felt first 
and most strongly in forests subject to relatively high local 
warming rates due to a reduction in canopy buffering. 

Finally, we fully and evidently agree that deep shade is 
not beneficial for many species, and we have never claimed 
otherwise. Instead, we make the point that it is important to 
consider the effects of different forest management practic- 
es on local microclimates in any endeavor to safeguard for- 
est biodiversity in a warming world. Forest ecologists are 
very much aware that many species benefit from canopy 
Openings, and many of us have extensively worked on this 
topic (9-11). We also fully agree that at a landscape scale, 
biodiversity is enhanced by forest patches with deep shade 
and more open patches or edge habitat, as shown by many 
studies, including our own work (172, 13). Environmental 
change affects forest understory plant communities in many 
ways, and we agree with the general remarks in Schall and 
Heinrichs’ concluding paragraphs. However, in terms of 
recent warming effects on forest biodiversity, we argue that 
microclimate warming and its local drivers deserve in- 
creased attention. 
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Season of the drought 
he Indian monsoon is a critical source of water for hundreds of millions of people, and when 
it fails to deliver its normal quantity of rain, enormous human, economic, and ecological 
costs can be incurred. Monsoon droughts are not always seasonal, however. Borah et al. 
found that nearly half of all monsoonal droughts were subseasonal and characterized by a 
steep decline in late-season rainfall. Moreover, this type of subseasonal drought appears 

to be related to a distinct cold anomaly in the North Atlantic Ocean, raising the possibility that 

monsoon droughts may be more predictable. —HJS Science, this issue p. 1335 
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A woman carries water across a dry lakebed in India, where a late-season decline in the monsoon can lead to drought. 


Efficiency from hole- 
selective contacts 


Perovskite/silicon tandem solar 
cells must stabilize a perovskite 
material with a wide bandgap 
and also maintain efficient 
charge carrier transport. 
Al-Ashouri et al. stabilized a 
perovskite with a 1.68-—electron 
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volt bandgap with a self-assem- 
bled monolayer that acted as an 
efficient hole-selective contact 
that minimizes nonradiative 
carrier recombination. In air 
without encapsulation, a tan- 
dem silicon cell retained 95% 
of its initial power conversion 
efficiency of 29% after 300 
hours of operation. —PDS 
Science, this issue p. 1300 
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Stem-like T cells mediate 


response 


Adoptive cell transfer (ACT) is 
a type of immunotherapy that 
uses a patient’s own T lympho- 
cytes to recognize and attack 
cancer. ACT has been effective 
in treating certain patients with 
metastatic melanoma and is 
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being applied to treat some 
epithelial cancers. Krishna 
et al. investigated why some 
cancer patients respond to ACT, 
whereas others do not. They 
identified a population of CD8* T 
cells that had stem-like surface 
markers that were associated 
with effective tumor cell killing 
and favorable response of 
melanoma patients to ACT. Only 
a small subset of T cells specific 
against tumor mutations were 
found in this stem-like state, 
whereas most mutation- 
reactive T cells were terminally 
differentiated. These findings 
could be of value in improving 
cancer immunotherapy out- 
comes. —PNK 

Science, this issue p. 1328 


Corals catch fire 
Pyroptosis Is a form of inflam- 
matory cell death that occurs in 
response to pathogen infection 
and results in the release of 
intracellular contents mediated 
by the pore-forming gasder- 
min family proteins. Jiang 
et al. identified a conserved 
gasdermin E homolog in corals 
that is cleaved by both coral 
and human caspase 3 to form 
two active N-terminal isoforms, 
each capable of inducing 
pyroptosis. After pathogen 
infection, caspase-dependent 
gasdermin E activation was 
associated with mitochondrial 
disruption and necrosis in 
the reef-building coral spe- 
cies Pocillopora damicornis. 
Gasdermin-mediated cell 
death is likely conserved in 
some invertebrates and may 
represent an immune defense 
activated in corals during bac- 
terial infection resulting from 
environmental stress. —CO 

Sci. Immunol. 5, eabd2591 (2020). 


A decline in the carbon 
fertilization effect 


One source of uncertainty in 
climate science is how the 
carbon fertilization effect (CFE) 
will contribute to mitigation 

of anthropogenic climate 
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change. Wang et al. explored 
the temporal dynamics of CFE 
on vegetation photosynthesis at 
the global scale. There has been 
a decline over recent decades in 
the contribution of CFE to vege- 
tation photosynthesis, perhaps 
owing to the limiting effects of 
plant nutrients such as nitrogen 
and phosphorus. This declining 
trend has not been adequately 
accounted for in carbon cycle 
models. CFE thus has limita- 
tions for long-term mitigation 
of climate change, and future 
warming might currently be 
underestimated. —AMS 

Science, this issue p.1295 


Subcellular map of 
vesicle maturation 


Insulin-containing vesicles in 
pancreatic B cells must migrate 
from the cell interior to the cell 
surface after stimulation by 
glucose. Two studies now use 
related whole-cell imaging tech- 
niques, soft x-ray tomography, 
and cryo-electron tomography 
to resolve the distribution, 

size, density, and location of 
insulin-containing vesicles as 

a function of time. White et al. 
visualized a mesoscale map 

of whole cells, and Zhang et 

al. provided higher-resolution 
structural information in 
specific ‘neighborhoods’ of the 
cell. Understanding insulin reg- 
ulation of circulating glucose by 
B cells will be advanced by this 
global picture of intracellular 





A colored soft x-ray tomogram shows 
insulin vesicles (yellow spheres) 
distributed within a pancreatic $ cell 
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dynamics along the insulin 
secretion pathway. —PLY 
Sci. Adv. 10.1126/sciadv.abc8262, 
10.1126/sciadv.abc8258 (2020). 


Molecular qubits that 
respond to light 


Spins in solid-state systems such 
as quantum dots and defect 
centers in diamond can easily 
be controlled by light for use in 
quantum information process- 
ing. More challenging is tuning 
their properties and making 
large arrays, something that 
can be done more easily with 
spins in molecules. Bayliss et al. 
combined the advantages of the 
two approaches by designing 
and characterizing three related 
molecular species that are opti- 
cally addressable. The molecules 
consist of a central chromium ion 
surrounded by organic ligands, 
and their spin and optical proper- 
ties can be tailored by simply 
changing the positions of methyl 
groups on the ligands. —JS 
Science, this issue p. 1309 


Electric field shielding of 
ultracold molecules 


Because reactive collisions 
limit the lifetime of ultracold 
molecular ensembles, con- 
trolling chemical reactivity at 
ultralow temperatures has been 
a long-standing goal. Using 
large electric fields that trigger 
resonant dipolar interactions 
between potassium-rubidium 
molecules trapped in a quasi- 
two-dimensional geometry, 
Matsuda et al. report suppres- 
sion of the reactive loss rate 
in the vicinity of the dipolar- 
mediated resonances by up to 
an order of magnitude below 
the background value. The 
proposed shielding mechanism 
is general and is expected to be 
effective in three-dimensional 
geometry. It could also be used 
for creating long-lived quantum 
molecular gases of other polar 
molecules under strong electric 
fields. —YS 

Science, this issue p. 1324 





RADIATIVE COOLING 


Edited by Caroline Ash 
and Jesse Smith 


A smarter radiative cooler 


assive daytime radiative cooling materials allow heat 

to be transported into outer space through Earth's 

atmospheric infrared transparency window. Xue et al. 

focus on engineering inexpensive coating materials for 

buildings, which take advantage of daytime radiative 
cooling but suppress it in the night when cooling no longer 
is needed. This is accomplished by widening the emissivity 
spectrum over the entire mid-infrared range. The strategy 
also uses common materials for the coatings that are more 
likely to withstand long-term weathering. —BG 


Adv. Mater. 32, e€1906751 (2020). 


False-colored scanning electron microscope image of a coating for 
cooling buildings, containing hollow glass microspheres (red) and 


fluorescent pigment (green) 


Alleviating side effects 
Platinum-based chemotherapy 
drugs, such as cisplatin and 
oxaliplatin, are commonly used 
to treat diverse cancer types. 
However, their use is limited 

by side effects, particularly 
vomiting, anorexia, muscle 
wasting, and weight loss. Breen 
et al. show that the amounts 

of the cytokine growth dif- 
ferentiation factor 15 (GDF15) 
increase In the circulation of 
patients with colorectal cancer, 
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non-small cell lung cancer 
(NSCLC), and ovarian cancer 
who were treated with platinum 
chemotherapy. Moreover, the 
amount of circulating GDF15 
was correlated with weight loss 
in metastatic colorectal cancer 
patients receiving oxaliplatin. 
Neutralization of GDF15 using 
monoclonal antibodies in 
nonhuman primates treated 
with cisplatin attenuated vomit- 
ing and anorexia. GDF15 also 
reversed weight loss in mice with 
NSCLC treated with cisplatin. 
This work shows that GDF15 has 
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Health care and conservation 


ogging in tropical forests affects the global climate, and there is an urgent need to reduce 

the rate of loss. Jones et a/. show how access to health care for an indigenous community 

can contribute to this goal. Over a 10-year study in rural Borneo, they found that logging 

was reduced by /0% in forests adjacent to communities for whom a local health care clinic 

was established. The health outcomes were positive, too, with reductions in the incidence 
of tuberculosis, malaria, and other tropical diseases. This pattern of positive outcomes holds 
promise for human health and forest conservation in tropical forest areas that are home to 
impoverished communities. —AMS Proc. Natl. Acad. Sci. U.S.A. 117, 28515 (2020). 


To reduce deforestation, rural communities in Borneo are exchanging chainsaws for affordable health care. 


a causal role in these side effects 
and could be targeted to enable 
optimal chemotherapy treat- 
ment. —GKA 


Cell Metab. 32,938 (2020). 


HEMATOPOIESIS 
Fetal versus adult 
hematopoiesis 


Development of an organism 
over time requires a multitude of 
factors, cells, and movements. 
The fetal-to-adult transition is 

a time of major transformation. 
The hematopoietic system is one 
development process that sees 
important changes to enable 
differential cell composition and 
activity. Li et al. used single-cell 
RNA-seg, ATAC-seg, and ChIP-seq 
in mice and observed a gradual, 
rather than an abrupt, shift in 
gene expression and the epigene- 
tic mechanism for hematopoietic 
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stem cells and hematopoietic 
progenitor cells. Indeed, neonatal 
and adult expression can occur at 
the same time and the changes 
seem to occur in an uncoor- 
dinated manner rather than 
following a strict gene regulatory 
network. Just before birth, type | 
interferon is activated and plays 
a specific role at the perinatal to 
postnatal hematopoietic system 
switch. —BAP 


Cell Stem Cell 27, 732 (2020). 


TOPOLOGICAL OPTICS 
Confining light—bound 
and protected 


The traditional method of confin- 
ing light does so within the cavity 
formed by two mirrored surfaces. 
Cerjan et al. explored another 
form of light confinement using 
the higher-order topological 
insulating states of crystals. 
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They show that these features 
provide protection against 
defects and chiral light propaga- 
tion. Controlling the symmetry 
of artificial crystal structures to 
produce topologically protected 
bound states could prove a flex- 
ible route to fabricate devices 
from materials where the usual 
methods to confine and manipu- 
late light fail. —ISO 


Phys. Rev. Lett. 125, 213901 (2020). 


HIV 


Wheeling out HIV 
elimination 


Several sub-Saharan countries 
have been prioritized by the 
Joint United Nations Programme 
on HIV/AIDS (UNAIDS) for HIV 
elimination. This will require 
90% treatment coverage by 

the end of 2030, but poor 
transport infrastructure means 
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incomplete access to medica- 
tion in some countries. Palk et 
al. developed a geospatial model 
for understanding the difficulties 
of reaching health care facilities 
in Malawi, a country severely 
affected by HIV. They mapped 
health care facilities with the 
density of HIV prevalence and 
quantified the difficulty of travel 
across Malawi's landscape by 
a friction surface raster map. 
If bicycles are used, then the 
catchment size for a health care 
facility is substantially larger 
than if people walk, and the 
required 90% treatment cover- 
age for elimination becomes 
achievable. Bicycles are already 
used as “ambulances” in rural 
areas, but Malawi's bicycle fleet 
is small and in poor repair. One 
straightforward route for beating 
HIV (and many other health 
conditions) here and in similar 
countries could lie simply in 
boosting the supply of bicycles, 
an established and well-tested 
technology. —CA 

Lancet Glob. Health 8, E1555 (2020). 


MEDICAL SCREENING 
Who is getting 
mammograms? 


Whether and when to recom- 
mended medical screening 
can be contentious. Einav et al. 
suggest that too little is known 
about how women who only 
seek mammograms in response 
to recommendations at age 
AO (“compliers”) might differ 
from women who don't wait for 
recommendations (“always- 
takers”) or who never get 
mammograms (“never-takers’ ). 
They estimate that always- 
takers have higher rates of in 
situ and Invasive cancer than 
the general population, whereas 
compliers are less likely to have 
invasive cancer and no more 
likely to have in situ cancer than 
never-takers. Screening debates 
must consider such selection 
effects because the typical 
woman who responds to the 
recommendation appears to be 
distinct from the broader popu- 
lation of women covered by the 
recommendation. —BW 

Amer. Econ. Rev. 110, 3836 (2020). 
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DISEASE DYNAMICS 
Emergence to endemism 


The emergence of a devastat- 
ing transmissible facial cancer 
among Tasmanian devils over 
the past few decades has caused 
substantial concern for their 
future because these animals 
are already threatened by a 
regional distribution and other 
stressors. Little is known about 
the overall history and trajec- 
tory of this disease. Patton et 
al. used an epidemiological 
phylodynamic approach to 
reveal the pattern of disease 
emergence and spread. They 
found that low Tasmanian devil 
densities appear to be contribut- 
ing to slower disease growth and 
spread, which is good news for 
Tasmanian devil persistence and 
suggests that care should be 
taken when considering options 
for increasing devil populations. 
—SNV 

Science, this issue p. 1293 


SYSTEMS BIOLOGY 
From phenotype 
to structure 


Much insight has come from 
structures of macromolecular 
complexes determined by meth- 
ods such as crystallography 

or cryo—electron microscopy. 
However, looking at transient 
complexes remains challenging, 
as does determining structures 
in the context of the cellular 
environment. Braberg et a/. used 
an integrative approach in which 
they mapped the phenotypic 
profiles of acomprehensive set 
of mutants in a protein complex 
in the context of gene deletions 
or environmental perturbations 
(see the Perspective by Wang). 
By associating the similarity 
between phenotypic profiles with 
the distance between residues, 
they determined structures 

for the yeast histone H3-H4 
complex, subunits Robl-Rpb2 

of yeast RNA polymerase II, and 
subunits RpoB-RpoC of bacterial 
RNA polymerase. Comparison 
with known structures shows 


SCIENCE sciencemag.org 


that the accuracy is comparable 
to structures determined based 
on chemical cross-links. —VV 
Science, this issue p. 1294; 
see also p. 1269 


BIOPHYSICS 
Rheology of aging protein 
condensates 


Protein condensates that form 
by undergoing liquid-liquid 
phase separation will show 
changes in their rheological 
properties with time, a process 
known as aging. Jawerth et 
al. used laser tweezer—based 
active and microbead-based 
passive rheology to characterize 
the time-dependent material 
properties of protein conden- 
sates (see the Perspective 
by Zhang). They found that 
condensate aging is not gela- 
tion of the condensates, but 
rather a changing viscoelastic 
Maxwell liquid with a viscosity 
that strongly increases with age, 
whereas the elastic modulus 
stays the same. —MSL 

Science, this issue p. 1317; 

see also p. 1271 


BATTERIES 
Cracking the problem of 
cracking cathodes 


Polycrystalline cathode materi- 
als that contain a combination 
of nickel, manganese, and 
cobalt have been used for 
advanced lithium batteries. 
These materials fracture at 

high voltage, which increases 
surface area and leads to more 
side reactions and shorter cycle 
life. Using single-crystalline 
samples as model materials, 

Bi et al. observed changes in 
nickel-rich cathodes to study 
the fracture behavior under well- 
characterized conditions. As the 
material is charged and lithium 
is removed, specific planes glide 
over one another and micro- 
cracks are observed. However, 
this process is reversed on 
discharge, removing all traces of 





the microcracking. The authors 
developed a diffusion-induced 
stress model to understand the 
origin of the planar gliding and 
propose ways to stabilize these 
nickel-rich cathodes in working 
batteries. —MSL 
Science, this issue p. 1313 


PALEOCLIMATE 
Controlling atmospheric 
carbon dioxide 


The atmospheric concentra- 
tion of carbon dioxide (CO,) has 
varied substantially over the 
past million years in tandem with 
the glacial cycle. Although it is 
widely agreed that upwelling of 
Southern Ocean water is a key 
factor, the finer details about 
what caused these CO, varia- 
tions are of great importance for 
understanding climate. Ai et al. 
identified three modes of change 
in Southern Ocean upwelling, 
adding a third to two previously 
recognized ones. This new mode 
can help explain better the rela- 
tive timing of the glacial and CO, 
cycles. —HJS 

Science, this issue p. 1348 


EVOLUTION 
Diversity does not 
drive speciation 


The role of the environment in 
the origin of new species has 
long been debated. Harvey et al. 
examined the evolutionary history 
and species diversity of subos- 
cine birds in the tropics (see the 
Perspective by Morlon). Contrary 
to expectations that the tropics 
have higher rates of speciation, 
the authors observed that higher 
and more constant speciation 
rates occur in harsh environments 
relative to the tropics. Thus, for 
this group of birds, diversification 
in temperate to Arctic regions 
followed by the movement and 
retention of species in the tropics 
results in their higher local levels 
of species diversity. —LMZ 
Science, this issue p. 1343; 
see also p. 1268 
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CORONAVIRUS 
Antibodies predating 
infection 


Immunological memory after 
infection with seasonal human 
coronaviruses (hCoVs) may 
potentially contribute to cross- 
protection against severe acute 
respiratory syndrome coro- 
navirus 2 (SARS-CoV-2). Ng 
et al. report that in a cohort of 
350 SARS-CoV-2-uninfected 
individuals, a small proportion 
had circulating immunoglobulin 
G (IgG) antibodies that could 
cross-react with the S2 subunit 
of the SARS-CoV-2 spike 
protein (see the Perspective 
by Guthmiller and Wilson). By 
contrast, COVID-19 patients 
generated IgA, IgG, and IgM anti- 
bodies that recognized both the 
Sl and S2 subunits. The anti-S2 
antibodies from SARS-CoV-2- 
uninfected patients showed 
specific neutralizing activity 
against both SARS-CoV-2 and 
SARS-CoV-2 S pseudotypes. 
A much higher percentage 
of SARS-CoV-2-uninfected 
children and adolescents were 
positive for these antibodies 
compared with adults. This pat- 
tern may be due to the fact that 
children and adolescents gener- 
ally have higher hCoV infection 
rates and a more diverse 
antibody repertoire, which may 
explain the age distribution of 
COVID-19 susceptibility. —STS 
Science, this issue p. 1339; 
see also p. 1272 


IMMUNOLOGY 


Small NOD to big changes 
The innate pathogen-sensing 
protein NOD1 mediates 
proinflammatory responses 

to molecular signals from 
pathogens, but certain genetic 
variants in the NODI gene are 
associated with an increased 
risk for developing gastric 
cancer. Rommereim et al. found 
that small increases in NOD1 
abundance resulted in dispro- 
portionately large increases in 
the expression of inflammatory 
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genes and oncogenes. A 1.2- 
to 1.3-fold increase in NOD1 
abundance reduced the amount 
of ligand required to activate 
NODI in a monocytic cell line. 
NOD1-mediated transcriptional 
responses became ligand 
independent upon a 1.5-fold 
increase in NOD1 abundance. 
—WW 

Sci. Signal. 13, eaba3244 (2020). 


INFLUENZA 

Influencing influenza 
immunity 

Humans are exposed to influ- 
enza virus throughout their 
lifetimes through a combination 
of infections and vaccinations. 
It remains unclear whether dif- 
ferent exposure routes induce 
distinct influenza-specific 
immunological memory. Dugan 
et al. found that infection- 
induced antibodies reacted 

to non-neutralizing epitopes 

of influenza virus, whereas 
vaccination-induced antibodies 
reacted to neutralizing epitopes. 
Infection-induced antibodies 
also preferentially responded 

to influenza strains present 
during an individual's childhood. 
Passive transfer of vaccination- 
induced antibodies, but not 
infection-induced antibodies, 
protected mice in a model of 
influenza infection. These find- 
ings demonstrate that existing 
influenza-specific memory and 
route of exposure influence 
influenza immunity. —CM 

Sci. Transl. Med. 12, eabd3601 (2020). 


CORONAVIRUS 
A history of vaccine safety 


Vaccines are safe and effective 
and save millions of lives each 
year. Vaccine development has 
led to rigorous safety protocols 
to ensure that lessons from 
history are not repeated. In a 
Perspective, Knipe et al. discuss 
the history of vaccine safety 
issues that have led to cur- 
rent regulations and protocols, 
including pausing trials when 
any adverse event arises and 
ensuring safety monitoring 
after regulatory approval. The 
need for a COVID-19 vaccine 
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and possible early regulatory 
approval before the completion 
of phase 3 trials could risk our 
ability to collect comprehensive 
safety data. It is important that 
Safety, as well as efficacy, IS 
evaluated before deployment. 
—GKA 

Science, this issue p. 1274 


ECONOMICS 
Taxing mental health 


Mental equilibrium is essential 
for an economically produc- 
tive life in both industrialized 
and developing countries. 
Accumulating evidence shows 
that mental ill-health and poverty 
tend to be traveling partners, 
but which is the cause? Ridley 
et al. reviewed the literature on 
natural and controlled economic 
experiments involving individu- 
als living in poverty. The authors 
sought to resolve the mecha- 
nisms whereby poverty triggers 
mental illness and how mental 
illness compounds poverty. 
Their results reveal the benefits 
of cash support and of low-cost 
therapeutic interventions for 
those suffering from mental ill- 
ness under poverty. —CA 
Science, this issue p. 1289 


MOLECULAR BIOLOGY 
CiBER-seq dissects 
genetic networks 


Cells integrate environmental 
signals and internal states 

to dynamically control gene 
expression. Muller et a/. devel- 
oped a technique to dissect this 
cellular logic by linking targeted, 
genome-wide genetic perturba- 
tions with a deep-sequencing 
readout that quantitatively mea- 
sured the expression phenotype 
induced by each perturbation. 
The method, dubbed CIBER- 
seq, was able to recapitulate 
known regulatory pathways 
linking protein synthesis with 
nutrient availability in budding 
yeast cells. Unexpectedly, the 
authors found that the cellular 
logic also appears to consider 
protein production machinery 
in this decision. By uncovering 
additional facets of this deeply 
conserved pathway, the findings 
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demonstrate the utility of 
comprehensive and quantitative 
CiIBER-seqg profiling in mapping 
the gene networks underlying 
cellular decisions. —SMH 
Science, this issue p. 1290 


DEVELOPMENTAL BIOLOGY 
Trunk formation in a dish 


Building mammalian embryos 
from self-organizing stem cells 
in culture would accelerate the 
investigation of morphogenetic 
and differentiation processes 
that shape the body plan. 
Veenvliet et a/. report a method 
for generating embryonic trunk- 
like structures (TLSs) with a 
neural tube, somites, and gut 
by embedding mouse embry- 
onic stem cell aggregates in an 
extracellular matrix surrogate. 
Live imaging and comparative 
single-cell transcriptomics 
indicate that TLS formation is 
analogous to mouse develop- 
ment. TLSs therefore provide a 
scalable, tractable, and acces- 
sible high-throughput platform 
for decoding mammalian 
embryogenesis at a high level of 
resolution. —BAP 

Science, this issue p. 1291 


DEVELOPMENTAL BIOLOGY 
Timing and trigger of 
cell polarization 


During mammalian embryo 
development, the first cell fate 
decision separates the pro- 
genitors of the trophectoderm 
(destined to form the placenta) 
from the inner cell mass (which 
forms all tissues of the embry- 
onic and yolk sac). A key event for 
this first lineage segregation Is 
the establishment of apicobasal 
cell polarity. This event is set to 
occur at a fixed developmental 
stage. The factors that trigger the 
establishment of cell polarity as 
well as its temporal regulation 
have remained unknown. Zhu et 
al. show in mouse embryos that 
three molecular regulators— 
Tfap2c, Tead4, and RhoA—are 
sufficient to advance the timing 
of cell polarization with subse- 
quent cell fate specification and 
morphogenesis. —BAP 

Science, this issue p. 1292 
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ECONOMICS 


Poverty, depression, and anxiety: Causal evidence 


and mechanisms 


Matthew Ridley, Gautam Rao, Frank Schilbach*, Vikram Patel 


BACKGROUND: Depression and anxiety disor- 
ders are together responsible for 8% of years 
lived with disability globally. Contrary to 
widely held preconceptions, these are not 
diseases of affluence. Within a given location, 
those with the lowest incomes are typically 
1.5 to 3 times more likely than the rich to 
experience depression or anxiety. 


ADVANCES: Recent research has established a 
bidirectional causal relationship between pov- 
erty and mental illness. Researchers 
have begun to isolate the underlying 
mechanisms, which can guide effec- 
tive policies to protect the mental 
health of those living in poverty. 

We now know that loss of income 
causes mental illness. Negative in- 
come shocks, such as bad harvests 
due to poor rainfall or job losses due 
to factory closures, worsen mental 
health. Conversely, cash transfers and 
broader antipoverty programs reduce 
depression and anxiety in randomized 
trials. Multiple mechanisms mediate 
this causal chain. Poverty is associ- 
ated with volatile income and ex- 
penditures. The resulting worries and 
uncertainty can worsen mental health. 
Providing health, employment, or 
weather insurance, or other ways of 
smoothing shocks, may thus lower 
depression and anxiety. Living in 
inadequate housing in low-income 
neighborhoods, the poor are also 
more exposed to environmental stresses such 
as pollution, temperature extremes, and chal- 
lenging sleep environments, which can cause 
mental illness. Early-life conditions—poverty 
experienced in childhood and in utero— 
increase the likelihood of poor nutrition and 
other stressors, resulting in impaired cog- 
nitive development and adult mental ill- 
ness. This makes a strong case for providing 
financial support to pregnant women and 
caregivers of young children. Poverty is also 
associated with worse physical health; greater 
exposure to trauma, violence, and crime; and 
lower social status, each of which may af- 
fect mental health. 

Mental illness in turn worsens econo- 
mic outcomes for individuals. Studies show 
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that randomized interventions to treat men- 
tal illnesses increase days worked. Depression 
and anxiety directly affect the way people 
think, by capturing their attention and dis- 
torting their memory. Such effects are likely 
to influence economic preferences and beliefs 
and thus distort important economic deci- 
sions made by individuals, such as how much 
to work, invest, and consume. Reduced con- 
centration and greater fatigue reduce work 
productivity, and the social stigma of men- 


Sa Poverty SNS 








SS 


Mood and anxiety 
disorders 


The causal relationship between poverty and common mental 
illnesses. This schematic shows the principal mechanisms we identify, 
on the basis of theory and empirical evidence, through which poverty 
and depressive and anxiety disorders interact. 


tal illness may further worsen labor-market 
outcomes. Mental illness appears to increase 
the likelihood of catastrophic health expendi- 
tures for individuals through its comorbidity 
with chronic illnesses such as diabetes and 
heart disease. Mental illness may also hinder 
education and skill acquisition among youth 
and exacerbate gender inequalities through 
its disproportionate prevalence among women. 
Parental mental illness can also influence 
children’s cognitive development and educa- 
tional attainment, transmitting mental illness 
and poverty across generations. 


OUTLOOK: The burden of mental illness is 
likely to increase in the coming decades. 


Although richer individuals within a given lo- 
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cation are less likely to be mentally ill, richer 
countries do not have lower rates of mental 
illness. Thus, aggregate economic growth 
alone is unlikely to reduce mental illness. Cli- 
mate change is likely to worsen mental health, 
both directly through the effect of higher tem- 
peratures on mood and through reductions 
in agricultural yields because of changes in 
rainfall and water supply, more frequent 
weather-related disasters, and an increased 
likelihood of violent conflict. Technological 
change and globalization create large over- 
all economic gains but also concentrated 
groups of losers whose mental health may 
be compromised. The spread of social me- 
dia and associated technologies may also 
be harming mental health, especially among 
adolescents. 

Policy action on mental health is vital, as is 
interdisciplinary research on the mechanisms 
that link poverty and mental illness. Recently 
developed approaches to psychotherapy, de- 
livered through nonspecialist providers, pro- 
vide a scalable and effective approach 
to improving mental health in low- 
income countries. Given the associ- 
ated economic benefits of improved 
mental health, such interventions 
should be a part of the antipoverty 
toolkit alongside more traditional 
economic interventions. Understand- 
ing the most effective combination 
of economic and psychological sup- 
port in different populations is an 
important next step. A priority for re- 
search is testing for a mental health- 
based “poverty trap.” If such poverty 
traps exist, then powerful one-time 
interventions will have large long- 
run effects as gains in mental health 
and economic outcomes reinforce 
one another. Evaluations of economic 
interventions should routinely mea- 
sure mental health, and long-run 
evaluations of mental health inter- 
ventions should measure potential 
impacts on poverty and other key 
economic outcomes. The causal relationship 
between poverty and mental health is even 
more pertinent given the ongoing pandemic, 
which has disproportionately affected the 
poor and may have lasting impacts on their 
economic and mental well-being. A massive 
investment in mental health was already 
long overdue. It has now become critically 
urgent. 
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Poverty, depression, and anxiety: Causal evidence 


and mechanisms 


Matthew Ridley’, Gautam Rao’, Frank Schilbach", Vikram Patel** 


Why are people who live in poverty disproportionately affected by mental illness? We review the 
interdisciplinary evidence of the bidirectional causal relationship between poverty and common 
mental illnesses—depression and anxiety—and the underlying mechanisms. Research shows that 
mental illness reduces employment and therefore income, and that psychological interventions 
generate economic gains. Similarly, negative economic shocks cause mental illness, and antipoverty 
programs such as cash transfers improve mental health. A crucial step toward the design of 
effective policies is to better understand the mechanisms underlying these causal effects. 


epression and anxiety are the most com- 

mon mental illnesses: 3 to 4% of the 

world’s population suffers from each 

at any given time, and they are together 

responsible for 8% of years lived with 
disability globally (7). Contrary to widely held 
preconceptions from the 20th century, these 
are not “diseases of affluence” (2, 3). Within a 
given location, those living in poverty are at 
least as likely to suffer as the rich. By some 
measures, the poor are substantially more 
likely than the affluent to experience mental 
ill-health. Rates of depression, anxiety, and 
suicide correlate negatively with income (4-7) 
and employment (5, 8). Those with the lowest 
incomes in a community suffer 1.5 to 3 times 
more frequently from depression, anxiety, and 
other common mental illnesses than those 
with the highest incomes (5). For example, 
in India, 3.4% of those in the lowest income 
quintile experience depression at any given 
time, compared with 1.9% of those in the 
highest quintile (Fig. 1). 

In this Review, we explore the evidence for 
the bidirectional causal relationship between 
poverty and mental health and its underlying 
mechanisms. Poverty is also correlated with 
poor physical health (9, 70), but the relation- 
ship between mental illness and poverty has 
been overlooked and is worth emphasizing. 
Mental health has historically not been con- 
sidered a priority by economists and policy- 
makers, and until recently, mental health 
care had not been evaluated as an antipoverty 
tool. Mental health services are underresourced. 
relative to physical health systems. On aver- 
age, countries spend 1.7% of their health bud- 
gets on mental health, even though 14% of 
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years lived with disability globally are known 
to be caused by depression, anxiety, and other 
mental illnesses (J). Low- and middle-income 
countries spend an even smaller share of al- 
ready small health budgets on mental health 
(Fig. 2). Despite the existence of cost-effective 
treatments, such low investments in mental 
health have contributed to treatment gaps of 
more than 80% globally for common mental 
illnesses, which is much larger than for major 
physical health conditions (1-14). Mental and 
physical health are tightly connected: When 
mental health problems coexist with physical 
health problems, health outcomes, disability, 
and costs tend to be much worse (15-17). How- 
ever, unlike most physical health conditions, 
mental disorders may directly distort economic 
decision-making in ways that perpetuate pov- 
erty, by directly affecting cognitive function, 
preferences, and beliefs. 


E Current prevalence 
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What are the causal links between poverty 
and mental illness? Can economic policies im- 
prove psychological well-being? Can psycho- 
logical interventions reduce poverty? Any 
attempt to understand this relationship must 
acknowledge the complexity and multidimen- 
sional nature of both mental health and 
poverty. Mental health in the broadest possi- 
ble sense has been defined as “a state of well- 
being in which the individual realizes his or 
her own abilities, can cope with the normal 
stresses of life, can work productively and 
fruitfully, and is able to make a contribution 
to his or her community” (18). This definition 
includes both happiness or life satisfaction, 
which also correlate positively with income 
(19), and symptoms associated with anxiety 
and mood disorders, such as depression. The 
two are clearly related; depression and anxiety 
are strong determinants of happiness (20), 
and ultimately, mental health and even men- 
tal illnesses such as depression and anxiety 
exist along a continuum. 

We focus on the causal evidence that links 
poverty with depressive and anxiety disorders, 
the most common mental illnesses, which we 
refer to here using the more general terms 
“mental health” and “mental illness.” Defini- 
tions of these illnesses and a brief primer on 
their measurement are provided in Box 1. 
Although other more serious mental illnesses, 
such as schizophrenia, are also correlated with 
poverty, and may have powerful effects on 
economic outcomes, we do not discuss them 
here (27). 

Like mental health, poverty is multidimen- 
sional. We examine causal links between men- 
tal illness and important economic dimensions 
of poverty, particularly income and unemploy- 
ment. We also touch on other dimensions 


— +1 SEM 
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Fig. 1. Prevalence of depression by income quintile in India. The average percentage of people in each 
income quintile in India who have had depression within the past 2 weeks (“current” prevalence). Error 
bars show +1 SEM. These numbers come from (130), an analysis of the Indian National Mental Health 


Survey, 2015-2016. 
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Fig. 2. Mental health expenditure by country income category. The average percentage of 
overall health budgets spent on mental health across countries in each of the four income 
categories used by the World Bank. Percent spent on mental health comes from the authors’ own 
calculations, using data on overall mental health spending from the WHO Mental Health Atlas 2017 
(www.who.int/mental_health/evidence/atlas), with data on total overall health spending from 

the WHO's Global Health Expenditure Database (https://apps.who.int/nha/database). 


Box 1. Definition and measurement of depression and anxiety. 


Depression, by which we refer here to major depressive disorder, is a constellation of symptoms 
that includes changes in psychomotor function, weight loss, oversleeping or undersleeping, decreased 
appetite, fatigue, difficulty concentrating, extreme feelings of guilt or worthlessness, and suicidal 
thoughts. According to the American Psychiatric Association's Diagnostic and Statistical Manual of 
Mental Disorders (DSM-5), diagnosis of depression requires a set of these symptoms to be present 
over a 2-week period. 

Anxiety, by which we refer here to generalized anxiety disorder, is characterized in the DSM-5 by 
long-lasting and excessive fear and worries over at least a 6-month period, with three or more of the 
following symptoms: restlessness, fatigue, concentration problems, irritability, muscle tension, and 
problems with sleep. Other definitions require the presence of at least one physical symptom such as 
heart palpitations, difficulty breathing, nausea or abdominal distress, dizziness, and numbness. 

Measuring depression and anxiety in large population samples is feasible by using nonspecialist 
Surveyors or even through self-administration of questionnaires. Reliable short-form diagnostic tools 
can predict professional diagnosis with rates of false positives and false negatives of 10 to 20% and 
have been validated in low-income countries (151-154). Widely used tools include the Generalized 
Anxiety Disorder 7-item (GAD-7) scale for anxiety, the Patient Health Questionnaire (PHQ-9) for 
depression, or the Self-Reporting Questionnaire 20-ltem (SRQ-20) scale for any common mental 
illness. These scales typically ask respondents how much they experienced symptoms of depression or 
anxiety (such as sadness, lack of concentration, or poor sleep) in the past few weeks. The PHQ-9 and 
GAD-7 ask one question for each of the symptoms that are used to define major depressive disorder 
and generalized anxiety disorder, respectively. In practice, depression and anxiety are correlated, as 
evidenced by the fact that they share some symptoms. 

The Center for Epidemiologic Studies Depression Scale (CES-D) is a popular measure among studies 
of the effect of economic interventions or shocks on mental health. Several studies also use custom 
indices of psychological well-being, typically an average of a life satisfaction scale, a “stress index,’ and 
some measure of worry or anxiety. In practice, such indices often measure several of the same 
symptoms as the PHQ-9 and GAD-7. 

Some national surveys already include short-form screening tools, such as the UK Longitudinal 
Household Panel Survey and the South Africa National Income Dynamics Study. 


of poverty, including a lack of capabilities 
resulting from low education and physical 
health, as well as relative poverty and asso- 
ciated low social status. Because of a relative 
scarcity of studies, we focus less on the rela- 
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tionship between mental health and the con- 
sumption of goods and services, which is a 
more direct economic measure of poverty. The 
existing evidence of this relationship is con- 
tentious, but the cross-sectional correlation of 
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mental illness with consumption appears to be 
weaker than that between mental health and 
income (22-25). Income is more volatile than 
consumption in the short run. The stronger 
correlation of mental health with income sug- 
gests that mental health may be more affected 
by short-run changes to economic status than 
long-run or permanent changes. 

We discuss evidence on poverty-alleviation 
programs and mental health treatments ob- 
tained from randomized controlled trials (RCTs). 
These generate variation in individuals’ pov- 
erty and mental health status, respectively, 
that is entirely by chance and therefore un- 
correlated with all other shared risk factors. 
Such studies allow us to isolate evidence of 
causal relationships. We also discuss studies 
of “natural experiments” in which naturally 
occurring variation in economic circumstan- 
ces or mental health is argued to be “as good 
as random.” Examples range from financial 
windfalls, such as lottery wins—for which 
lottery winners may be thought of as a treat- 
ment group and lottery losers as a control 
group—to weather shocks that affect some 
farmers’ incomes more than others’. 


The causal impact of poverty on mental 
ill-health 


Job loss and income declines are drivers of 
poverty and often precede episodes of mental 
illness (26, 27). Evidence from natural exper- 
iments confirms that this relationship is causal. 
For example, reduced agricultural output and 
income because of extreme rainfall caused 
increased rates of depression and suicide in 
rural parts of Indonesia (Fig. 3 and Box 2) (28). 
Similarly, job losses because of plant closures 
in Austria were associated with higher subse- 
quent antidepressant use and mental health- 
related hospitalization (29). Areas in the United 
States more exposed to trade liberalization 
with China saw reduced income and employ- 
ment for some groups of workers and in- 
creased mortality through drug overdoses 
among those same groups (30). Whether 
job loss worsens mental health beyond the 
impacts of the associated loss of income is 
unclear, but both mechanisms are argued 
to play a role in the phenomenon of “deaths 
of despair” (37).Conversely, income or wealth 
increases can improve mental health. For ex- 
ample, Native American tribes that opened 
casinos have seen substantial rises in income 
and reductions in anxiety relative to those that 
did not (32). Some studies have shown that 
lottery winners enjoy better mental health 
compared with those who win less or play 
but do not win (33). However, when fully 
controlling for the number and frequency 
of lottery tickets bought, winnings have small 
or no impacts on mental health (34, 35). 
The most compelling causal evidence that 
poverty causes mental illness comes from 
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Fig. 3. Cash transfers, suicide rates, and droughts. The estimated effect of the cash transfer roll-out on 
district suicide rates, for all districts and separately by whether or not they were experiencing a drought 
(bottom 20% of the rainfall distribution) when the cash transfers reached them. Error bars show +1 SEM. 
Asterisks denote a significant difference between effects: ***P < 0.01. 


Box 2. Cash transfers, rainfall shocks, and suicides. 


Christian, Hensel, and Roth examined how income shocks affect suicide rates and depression in 
Indonesia (28). They examined two natural experiments: the staggered roll-out across subdistricts of a 
conditional cash transfer program and annual and spatial variation in rainfall that affects farmers’ 
incomes. They measured depression using a 10-question CES-D scale that is included in the Indonesian 
Family Life Survey. They also used the incidence of suicides, as measured by the reports of village 
leaders in census surveys. 

Subdistricts that received the cash transfer program in the first wave of roll-out saw an 18% drop in 
suicides (P < 0.01) relative to those that received it later, even though both sets of districts had similar 
trends in suicide before the program's start. Meanwhile, rural subdistricts that experienced excess 
rainfall that increased crop yields between census years saw decreases in depression and suicides 
relative to subdistricts experiencing drought. The cash transfer had its largest effects on suicide in 
districts undergoing droughts, suggesting that policy can play a role in mitigating the mental health 
effects of economic shocks (Fig. 3). 

Because suicide was only measured at the subdistrict level in this study, it is not possible to fully 
disentangle the direct effects of the cash transfers on recipients from spillover effects to others in the 
village. This highlights the need for better routine data collection on mental health outcomes alongside 


economic variables. 


RCTs that evaluate antipoverty programs. 
Several studies that evaluated cash transfer 
and broader antipoverty programs have found 
substantial positive impacts on mental health, 
including over long time horizons, after the 
effects of any initial celebratory reactions 
among recipients have worn off. A meta- 
analysis of these studies is provided in Fig. 4. 
For example, cash transfers to Kenyan house- 
holds worth $400 to $1500 at purchasing 
power parity (about 3 to 12 months of house- 
hold income) increased consumption and hap- 
piness while reducing depression, stress, and 
wotries (36, 37). Scores on a depression scale 
were 0.12 standard deviations (SD; closely 
related to Cohen’s d) lower 4 months after 
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completion and 0.16 SD lower after almost 
3 years, with larger transfers causing sub- 
stantially larger effects. 

Similarly, multifaceted antipoverty pro- 
grams beyond cash transfers yield mental 
health benefits. A recent large-scale random- 
ized evaluation of a “graduation program” 
in six countries that provided extremely poor 
participants with a mix of assets, intensive 
training, temporary cash support, savings 
incentives, and help to access health care 
found increases in consumption and assets 
3 years later. The program also improved an 
index of psychological well-being by 0.1 SD, 
which was driven by an increase in happiness 
and a decrease in mental distress (38). Prog- 
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rams in other settings with similar approaches 
have found similar effects (39, 40). Longer-run 
effects of such programs, when measured, 
appear to be even larger for both economic 
outcomes and mental health. In India, for ex- 
ample, an index of psychological well-being 
was 0.24 SD higher in the treatment group 
7 years after the completion of a graduation 
program (41). 

Hence, across a wide range of populations 
and study designs, positive economic shocks 
to individuals are shown to improve mental 
health, whereas negative economic shocks 
undermine mental health. This robust evi- 
dence, on the effects of changes in economic 
circumstances, indicates that poverty does 
cause mental illness. However, with two ex- 
ceptions (35, 41), the above studies consider 
the consequences of changes in economic 
status for a few years at most. An important 
question is whether these short-run effects can 
persist or grow over time. For example, some 
of the causal mechanisms we discuss below 
could take decades to manifest. However, 
there may be a hedonic adaptation effect in 
which mental health eventually adapts to 
the change in circumstances, so that even 
permanent increases in one’s income level 
have a limited long-run effect. Of course, it 
could also be that the positive economic 
shocks are themselves undone by future nega- 
tive shocks, causing mental health to revert 
to initial levels or even worsening. Ongoing 
long-run evaluations of cash-transfer prog- 
rams are expected to provide evidence on 
this question (42). 


Mechanisms for poverty causing mental ill-health 


How does poverty cause mental illness? We 
discuss several plausible causal mechanisms 
and the limited evidence for each. The worries 
and uncertainty that come with living in pov- 
erty seem to be an important driver of mental 
illness, as do the effects of poverty on child- 
hood development and one’s living environ- 
ment. We have more limited causal evidence 
for other plausible channels, including wor- 
sening of physical health caused by poverty, 
increased exposure to violence or crime, and 
the effects of low relative social status and 
social isolation. Understanding which of these 
mechanisms are important may have implica- 
tions for policy choices. For example, if worries 
and uncertainty play a major role, then provid- 
ing health and unemployment insurance may 
be crucial, whereas if early-life conditions are 
the key drivers, then cash transfers to parents 
of young children could be the most appropri- 
ate policy response. 


Worries and uncertainty 


The anticipation of economic shocks, not 
just their occurrence, may cause mental ill- 
ness. People living in poverty face substantial 
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Study Country Outcome Years elapsed since: Intervention cost in: 
Program Start Program End $MER $PPP 
Multifaceted antipoverty programs 
Blattman et al. (2019) Ethiopia PWB 5 4 450 1291 os 
Edmonds and Theoharides (2020) Philippines PWB 3 0.75 184 518 g 
Green et al. (2016) Uganda APAI-R 1.3 - 874 2150 ogee 
Janzen et al. (2018) Nepal PWB 1.5 z 137 478 — 
Banerjee et al. (2015) Multiple PWB 3 1 1467 3717 DR 
Bandiera et al. (2017) Bangladesh PWB 4 2.5 302 1120 Je 
Ismayilova et al. (2018) Burkina Faso CES-D 2 1 100 279 —— r 
Glass et al. (2017) DRC HSCL 1.5 = 40 66 Te 
Banerjee et al. (2016) India PWB 7 5.5 357 1257 =I 
Bedoya et al. (2019) Afghanistan PWB 2 1 1688 6198 _— A > 
Cash transfers Po 
Heath et al. (2020) Mali PSS 3 - 649 1468 — ‘e 
Fernald and Hidrobo (2011) Ecuador CES-D 1.4 - 255 676 —u 
Hjelm et al. (2017a) Zambia PSS 3 - 396 816 — 
Macours and Vakis (2009) Nicaragua CES-D 0.75 - 260 758 Oo. 
Kilburn et al. (2019) South Africa CES-D 3 - 540 870 — 
Baird et al. (2019) Malawi GHQ-12 4.3 2.3 180 440 ov 
Blattman et al. (2017) Liberia APAI-R 1 0.8 341 716 —_—Jo oO 
Haushofer et al. (2020) Kenya PWB 1 1 150 338 — a 
Blattman, Fiala and Martinez (2019) Uganda PWB 9 9 382 1175 = 
Shapiro (2019) Kenya PWB 0.75 0.5 122 314 —_1— 
Hjelm et al. (2017b) Zambia PSS 3 - 432 891 stv 
Egger et al. (2019) Kenya PWB 1.5 1.5 1000 1871 —_1— 
Baird et al. (2013) Malawi GHQ-12 2.3 0.3 180 440 Ov 
Kilburn et al. (2016) Kenya CES-D 4 - 960 2370 —— 
Aguilar (2012) Mexico SWB 5 2 900 1651 SS E 
Haushofer and Shapiro (2018) Kenya PWB 3.4 3 521 709 — 
Haushofer, Mudida and Shapiro (2020) Kenya PWB 1 1 534 1184 a 
Angeles et al. (2019) Malawi CES-D 2 - 156 517 oT 
Multifaceted antipoverty programs effect (average: 0.138 SD) es 
Cash transfers effect (average: 0.067 SD) a 
Overall effect (average: 0.094 SD) aiid 
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Fig. 4. The impacts of antipoverty programs on mental health. The 
estimated treatment effects of RCTs that evaluated antipoverty programs in low- 
or middle-income countries on indices of mental health (37-41, 56, 119, 131-149). 
Positive treatment effects imply better mental health. “Cash Transfers” refers 

to studies of unconditional cash transfers to low-income households, with the 
exception of Baird et al. and Kilburn et al., who examined conditional cash 
transfer programs (135, 150). “Multifaceted antipoverty programs” refers to 
interventions that aimed to lift people out of poverty by providing a range of 
elements, typically including asset transfers, skills training, cash support, and access 
to savings and health care opportunities. Treatment effects are expressed in SD 
units. If multiple follow-up measures were available, this figure shows the final 
measure. The outcomes vary across studies and include screening instruments to 
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detect common mental illnesses [Short General Health Questionnaire (GHQ-12)] 
and symptoms of depression [Center for Epidemiologic Studies Depression 
Scale (CES-D) and Acholi Psychosocial Assessment (APAI-R)], indices of 
psychological well-being (PWB), and a perceived stress scale (PSS). “Intervention 
cost” refers to the total cost of implementing the intervention, when this is 
available. When implementation costs are unavailable (as with most of the cash 
transfer studies), it refers to the total undiscounted value of the transfer. MER, 
market exchange rates; PPP, purchasing power parity (which adjusts exchange 
rates to reflect the true cost of living). A missing value of years elapsed since 
program end means that the transfer was still ongoing when outcomes were 
measured. A complete description of the methodology of this analysis and details 
on each of the studies is provided in the supplementary materials. 


uncertainty and income volatility and jug- 
gle what are, in effect, complex financial 
portfolios, often without access to formal 
insurance (43). Sustained long-run exposure 
to stress from managing this volatility may 
threaten mental health (44). Consistent with 
this hypothesis, a large-scale randomized ex- 
periment among low-income individuals in 
Oregon found that receiving largely free health 
insurance worth $550 to $750 per year re- 
duced rates of depression by about a quarter 
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within a few months (45). This effect did not 
appear to be explained by increased mental 
health care or changes in physical health. Al- 
though the increase in recipients’ effective 
income may have played a role, it represented 
a much smaller relative increase than the cash 
transfer programs described above and yet 
generated a similar effect size on depression. 
Further suggestive evidence for uncertainty as 
a mechanism comes from the small or zero 
effect of wealth shocks on mental health in 
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countries with generous and comprehensive 
systems of social insurance, such as Sweden 
(34, 35). 


Environmental factors 


Those living in poverty are generally more 
exposed to environmental irritants posed by 
pollution, temperature extremes, and chal- 
lenging sleep environments (46). Many of 
these factors have been linked directly to men- 
tal illness. Days with extreme heat see worse 
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self-reported mental health and increased 
rates of self-harm and suicide (47, 48). Sim- 
ilarly, sleep deprivation is widespread among 
the urban poor in developing countries (49), 
and sleep is thought to be a mechanism af- 
fecting mental health (50, 57). Some evi- 
dence exists that clinical interventions to 
improve sleep reduce depression (49, 52). 
Exposure to air pollution is associated with 
living in poverty and may influence mental 
health through multiple channels, through re- 
striction of physical activity or directly owing 
to neurotoxicity (53). Changes in air pollution 
in China have been associated with changes in 
mental health (54). In the United States, ran- 
domly selected low-income households that 
were paid to move to more affluent neighbor- 
hoods saw reductions in depression and anxi- 
ety despite little effect on income (55). However, 
it is not clear whether environmental factors 
or other features of high-income neighbor- 
hoods generated this effect. 


Physical health 


Lower income is robustly associated with 
worse physical health (9). Poverty increases 
exposure to the environmental factors de- 
scribed above and often also implies lower 
access to health care, which increases the 
burden of acute and chronic health condi- 
tions. Worse physical health may affect men- 
tal health through various channels. Chronic 
pain, worries about health and mortality, the 
financial costs of illness, and reduced physi- 
cal activity may all worsen mental health. It is 
therefore unsurprising that physical ill-health 
often co-occurs with depressive and anxiety 
disorders (15). However, only limited causal 
evidence exists of poverty affecting mental 
health through changes in physical health. 
Many of the randomized interventions de- 
scribed in the previous section had no detec- 
table effect on physical health even as they 
reduced mental illness (36, 38, 56). However, 
changes in physical well-being may manifest 
over a longer time frame, which may not be 
captured by these short-run studies. 


Early-life conditions 


Exposure to poverty early in life can threaten 
mental health in later years. Such effects can 
be generated in utero, by exposing pregnant 
women to malnutrition or stress. For example, 
the death of a mother’s relative during preg- 
nancy (compared with after childbirth) pre- 
dicts depression and anxiety among her grown 
children later in life (57). Poverty may also 
disproportionately expose children to adverse 
shocks while their brains are highly plastic. 
Such shocks can profoundly affect brain de- 
velopment, cognitive ability, and mental health 
in adolescence and adulthood (58, 59). Eco- 
nomic stresses around the time of birth also 
have long-term mental health costs; in Ghana, 
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a decrease in crop prices by 1 SD at an indi- 
vidual’s time of birth was found to increase 
incidence of anxiety or depression in adult- 
hood by 50%, associating with maternal nutri- 
tion, breastfeeding duration, vaccination rates, 
and improved adult health (60). These results 
imply that programs that provide financial 
support for households with pregnant women 
or young children may have high long-run 
mental health and economic returns. 


Trauma, violence, and crime 


Living in poverty disproportionately exposes 
individuals to crime, including violent offenses 
(61). People living in poverty are also more 
likely to suffer traumatic events such as the 
early deaths of loved ones (62). Likewise, with- 
in the household, women and children in poor 
households are disproportionately affected 
by intimate partner violence (63). The rela- 
tionship between poverty and experiencing 
violence itself may be causal: Cash transfers to 
households reduce intimate partner violence 
(64). In turn, both exposure to violence within 
the household and exposure to violent crime 
in general predict depression and other men- 
tal illnesses (65, 66). Causal evidence on the 
effect of reductions in crime and violence on 
mental illness is needed to shed further light 
on this mechanism. 


Social status, shame, and isolation 


Relative poverty—consumption or income rel- 
ative to others in one’s society—may play a 
role in the relationship between poverty and 
mental illness through the resulting social 
status and interpersonal comparisons. In an 
interesting natural experiment, Norwegian 
tax records were posted online in 2001, mak- 
ing citizens’ income easily searchable. Using 
survey data from 1985 to 2013, a study showed 
that the gap in happiness and life-satisfaction 
between the rich and poor within Norway 
increased sharply once relative income became 
easily visible (67). Although similar causal evi- 
dence is lacking for mental illness, it is plau- 
sible that diminished social status resulting 
from poverty causes or exacerbates depression 
and anxiety. Frequent marginalization of peo- 
ple living in poverty may also result in social 
isolation and loneliness (68), which in turn are 
correlated with depression (69). 


The causal impact of mental ill-health 
on poverty 


Mental illness predicts worse labor market 
outcomes later in life. After a diagnosis of de- 
pression or anxiety, employment rates and 
incomes have been estimated to fall by as 
much as half, relative to the nondepressed or 
nonanxious (70, 71). Beyond such compari- 
sons, which may be driven in part by unknown 
factors such as physical health, there is little 
evidence from natural experiments to link 
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depression or anxiety to incomes. A study 
showed that the approval of lithium for treat- 
ment of bipolar disorder reduced the earn- 
ings penalty associated with bipolar illness 
by a third in Denmark, from 38 to 26%, with 
larger effects in the lower half of the earn- 
ings distribution (72). Studying similar natural 
experiments for depression and anxiety would 
be valuable. 

There is, however, a substantial body of 
experiments that show a causal effect of treat- 
ing mental illness on employment. A meta- 
analysis that aggregated results across 31 RCTs 
in developing countries showed a positive 
average effect of various interventions to 
treat mental illness on labor supply (73). 
Among these interventions, pharmacological 
and psychological treatments had similar po- 
sitive effects on labor supply (0.1 to 0.15 SD), 
and combining both types of treatments had 
even larger effects (0.34 SD). For example, a 
cheap and scalable cognitive behavioral the- 
rapy (CBT) administered in India reduced de- 
pression by 25 percentage points compared 
with that in the control group and in turn 
increased reported days of work by 2.3 days 
per month (Fig. 5 and Box 3). Although these 
studies do not directly show that treatment 
of mental illness reduces poverty rates, higher 
labor supply and earnings naturally reduce 
the likelihood of living in poverty. Whether 
treatment of mental illness has larger long- 
run effects on consumption per unit cost 
than the cash transfers described above is 
unknown (73). 


Mechanisms for mental ill-health 
causing poverty 
Cognitive function 


Like any illness, depression and anxiety may 
have economic effects because they directly 
reduce individuals’ ability to work. Unlike 
most physical conditions, however, depres- 
sion and anxiety also directly affect the way 
people think. Poverty itself can influence cog- 
nitive function by capturing attention and 
taxing mental bandwidth (74, 75). Mental 
illness may have similar effects, by capturing 
attention, causing excessive rumination, and 
distorting people’s memories and beliefs about 
their abilities (76). Such cognitive impacts 
could alter a range of economic decisions 
and outcomes, from finding jobs to saving 
to education and by exacerbating “behav- 
ioral biases” that economists increasingly 
recognize as important (77). For example, 
depressed individuals might avoid making 
active choices and may stick with “default 
options,” may have decreased sensitivity to 
incentives because of anhedonia, or may have 
difficulty choosing among several options. 
Understanding the importance of this mech- 
anism relative to more “direct” economic ef- 
fects through disability or health expenditures 
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Fig. 5. Impacts of behavioral activation on depression, labor supply, and health costs. The mean values for the treatment and control groups of depression 
[measured with a Patient Health Questionnaire-9 (PHQ-9) score greater than 10], days unable to work, and health costs (excluding the intervention cost) at 3 months. 
Error bars denote +1 SEM. Asterisks denote a significant difference between treatment and control after adjusting for covariates: *P < 0.1; ***P < 0.01. 


Box 3. An example of a psychotherapy intervention with positive economic effects. 


Patel et al. conducted an RCT of a brief behavioral activation (BA) therapy program, administered by 
nonspecialist counselors in a sample of 495 depressed adults in Goa, India (155). Compared with a 
control group that received enhanced usual care, treated patients were >60% more likely to be in 
remission 3 months later (64 versus 39%), as measured with a PHQ-9 score (Box 1) below 10, and 
maintained these gains after 12 months. 

Those patients also reported being able to work 2.3 more days per month on average (P = 0.004) 
and reduced health costs, excluding intervention costs, of $20 per month (P = 0.07) (Fig. 5). For 
comparison, a month’s wages for a low-skilled worker in the study context was around $415. Given an 
average intervention cost of $66 per patient, in economic terms the intervention was highly cost- 
effective and may have paid for itself within a few months. After 12 months, the fall in treated patients’ 
health costs alone had already significantly outpaced the cost of intervention, although the difference 


in days worked was no longer significant (94). 


Other evaluations of inexpensive psychotherapies implemented by nonspecialist counselors in low- 
income settings have found similarly large effects on mental health (122). More evidence on the effect 
of such psychological interventions on broader economic outcomes would be highly valuable. For 
example, future trials could be linked to administrative or standardized survey data on wages, earnings, 


and consumption. 


is crucial for correctly measuring the eco- 
nomic burden of mental illness and design- 
ing economic policy for those whose mental 
health is compromised. 


Beliefs 


Beliefs about one’s own and others’ abilities, 
circumstances, and actions are central to eco- 
nomic decision-making. Mental illness may 
distort such beliefs in various ways. Depres- 
sion is associated with more negative beliefs 
about oneself and the external world (78, 79). 
Depressed individuals are more likely to re- 
member negative stimuli and have trouble 
disengaging from negative information once 
it grabs their attention (76). As such, although 
healthy individuals tend to protect overly 
optimistic beliefs about themselves by ignor- 
ing negative information (80), correlational 
evidence suggests that the depressed update 
their beliefs more pessimistically (87). Anx- 
iety, meanwhile, is associated with greater 
selective attention toward threatening stimuli 
(82), which could lead to overestimation of 
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risks and thus reduced risk-taking. Such evi- 
dence is consistent with mental illness caus- 
ing pessimistic beliefs, pessimistic beliefs 
causing mental illness, or both. Causal evi- 
dence on how treatment of depression or 
anxiety affects beliefs would help disentangle 
these potential explanations. 


Preferences 


Mental illness may affect economic prefer- 
ences, such as the extent to which people are 
willing to defer gratification (time preferen- 
ces), tolerate risk for higher expected rewards 
(risk preferences), or split rewards between 
themselves and others (social preferences). 
For example, depression may diminish a per- 
son’s patience and altruism. Similarly, anxiety 
disorders may reduce people’s willingness to 
take on even modest levels of potentially pro- 
fitable risk. Such impacts could in turn change 
a variety of economic behaviors, such as la- 
bor supply decisions, savings and investment 
choices, consumption behavior, and the take- 
up of social programs. The limited evidence 
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on correlations between mental illness and 
economic preferences is mixed (83-835). 


Labor supply and productivity 


Depression and anxiety often affect individ- 
uals in the prime of their economic lives and 
can be highly recurrent (86). The depressed 
beliefs and distorted preferences described 
above may reduce motivation and labor sup- 
ply. In addition, depression can have a direct 
effect on productivity, such as through in- 
creased fatigue and lack of concentration. 
Depressed individuals may therefore work 
fewer and shorter days and produce less per 
hour (87). Depressed workers might also 
be more easily discouraged during their 
job search or when facing setbacks at work. 
As described above, substantial causal evi- 
dence exists that treatment of mental illnesses 
increases employment (73). However, there 
is little evidence on whether this happens 
through higher at-work productivity, greater 
job search intensity, changed beliefs, or other 
mechanisms. 


Stigma 


Mentally ill individuals contend with sub- 
stantial social stigma and negative stereo- 
typing (88). This may result in discrimination 
in employment (89), which could lower wages 
and limit employment opportunities relative 
to equally productive mentally healthy work- 
ers. On top of this, those living with a mental 
illness are excluded from disability benefit 
schemes in many low-income countries (90). 
More generally, others’ reluctance to inter- 
act socially with mentally ill people (88) may 
exclude them from social networks that pro- 
vide economic opportunities. Stigma may also 
affect the formation and dissolution of house- 
holds in ways that disadvantage the mentally 
ill (97). Depression and anxiety may come with 
a “discount” on the marriage market, causing 
mentally ill individuals to form households 
with less well-off partners, increasing the 
chances of living in poverty. 
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Health expenditures 

Mental illnesses may deepen poverty through 
its impacts on health and health expenditures. 
In developing countries, people living in pov- 
erty usually pay most of their health costs out 
of pocket (92). Globally, 150 million people are 
estimated to have catastrophic health expen- 
ditures each year, which are defined as health 
care payments totaling more than 40% of a 
household’s nonsubsistence expenditures (93). 
Costs associated with treatment of mental 
illness rarely account for large shares of in- 
dividuals’ budgets because most affected 
individuals remain untreated. However, de- 
pression and anxiety frequently co-occur with 
other health conditions (15), and such comor- 
bidity with depression is associated with 
substantially higher health expenditures for 
a range of health conditions (16, 17). Indeed, 
treatment of depression has been found to 
reduce overall health care costs (94). 


Women’s empowerment 


The burden of mental illness falls dispropor- 
tionately on women (J). A large-scale (m = 903 
participants) RCT that evaluated CBT inter- 
vention for depressed pregnant women in 
Pakistan found a 17% reduction in depression 
rates compared with a control group 7 years 
after the intervention (95). Reduced depres- 
sion among these women was accompanied by 
increased economic empowerment by 0.29 SD 
as measured in increased control over house- 
hold and personal expenditures. Such im- 
pacts may have implications for women’s 
consumption and relative poverty within the 
household. 


Intergenerational effects 


Improving a parent’s mental health can bene- 
fit the next generation. In the above study in 
Pakistan, women who had received the inter- 
vention sent their children to better schools 
and had more learning materials in their homes 
(95). Similarly, other RCTs found that treat- 
ment of mothers’ depression improves their 
interaction with their children and their child- 
ren’s mental health (96). Although little direct 
evidence shows that such interventions lead 
to improved educational outcomes or earn- 
ings, there is reason to believe they may. A 
substantial body of work from other con- 
texts shows that early-childhood invest- 
ments have large effects on children’s income 
as adults (97). 


Human capital accumulation 


The onset of common mental illnesses often 
coincides with secondary and tertiary edu- 
cation and the early stages of an adult’s work 
career (86). Mental illness may therefore cause 
long-run economic hardship by reducing school 
and college completion rates, worsening early- 
career job placements and hindering skill 
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acquisition (98). This suggests the possibility 
of particularly high economic returns from 
improving mental health among adolescents 
and young adults. Although longitudinal stud- 
ies show a substantial correlation between 
mental illness among students and subsequent 
educational outcomes, there is little experimen- 
tal evidence to date that treatment of depres- 
sion or anxiety among adolescents leads to 
improved educational outcomes (99). 


Outlook 


Having discussed some of the mechanisms 
that influence the relationship between pov- 
erty and mental health, we can speculate more 
broadly on how the relationship between po- 
verty and mental illness may evolve, what this 
means for policy, and what directions may be 
fruitful for research. 


Aggregate economic conditions 


Economic growth and other ongoing global 
trends are unlikely to improve mental health 
by themselves. Higher income causes better 
mental health at the individual level, yet on 
average, the prevalence of mental illness is 
not lower in rich countries. To the contrary, 
existing evidence shows a higher prevalence 
of common mental illness in richer countries 
(Fig. 6) (J00). This cross-country difference 
cannot be interpreted causally, and concerns 
remain about differences in methodology, 
diagnosis, or reporting across contexts (101). 
However, one way to reconcile the contrast- 
ing within-country and cross-country relation- 
ships is the possibility that relative, rather 
than absolute, poverty is the more relevant 
cause of mental illness. Risk factors, includ- 
ing inequality and relative poverty, or the 
stresses of urban environments, may deterio- 
rate rather than improve as whole economies 
expand. Within-country inequality has in- 
creased in many countries in recent decades, 
despite substantial reductions in extreme 
poverty and global inequality (102). Com- 
placency about mental health among the 
poor is therefore not warranted even in the 
presence of aggregate economic growth. For 
example, the burden of disease attributable 
to mental and neurological disorders in India 
and China increased by 61 and 28%, respec- 
tively, between 1990 and 2013, despite im- 
pressive economic growth (103). 


Climate change 


The more frequent occurrence of extreme heat 
because of climate change is anticipated to 
exacerbate mental illness directly (104, 105). 
The increased frequency of weather-related 
disasters, such as floods and hurricanes, poses 
a threat to mental health through greater ex- 
posure to trauma (106). Climate change also 
threatens mental health through its negative 
economic consequences, which are likely to 


11 December 2020 


be more pronounced in low-income countries 
(104). Extreme temperatures during the ag- 
ricultural growing season that damage crops 
and thus economic well-being have been re- 
ported to increase suicides in agricultural 
regions in India (107). Predicted increases in 
water scarcity and droughts are also likely to 
worsen economic and in turn psychological 
well-being. Climate change is also expected to 
lead to increased violence and political conflict 
over the next century through increased pres- 
sure on resources, such as productive land 
and, possibly, psychological effects of heat on 
aggression (108). This combination of econo- 
mic and political consequences of climate 
change may increase the flow of refugees and 
economic migrants, with concomitant chal- 
lenges to mental health (709). 


Technological change and globalization 


For many of those living in poverty across the 
world, technological change and globalization 
offer enormous economic opportunities; how- 
ever, both phenomena produce winners and 
losers. The costs to losers, especially low-wage 
workers in high- and middle-income countries 
who lose jobs as a result of changes in patterns 
of trade or automation, can be long lasting 
and substantial (770), resulting in worse men- 
tal health (29, 30). Although most economic 
research on these topics focuses on rich coun- 
tries, there is an urgent need to understand 
the mental-health effects of these economic 
changes in poorer countries. 

Offering social insurance and welfare, skills 
training, and job transition programs, includ- 
ing psychotherapies for workers exposed to 
the harmful effects of technological change 
and globalization, will be important to pro- 
tect mental health. One example is the on- 
going Building Bridges and Bonds evaluation, 
which provides a tailored CBT intervention 
to unemployed (formerly incarcerated) men 
in the United States, in conjunction with tra- 
ditional job-seeking services, in an attempt 
to increase employment and wages. 


Pandemics 


Public-health crises such as the ongoing 
COVID-19 pandemic tend to disproportionate- 
ly affect those living in poverty (117). They may 
worsen mental health on average and partic- 
ularly among the poor. Income and employ- 
ment losses as a result of morbidity can be 
large, which in turn can reduce mental health 
through the mechanisms described above. In 
addition, the exposure to trauma, increased 
worries and uncertainty, and worse physical 
health will tend to impair mental health, in 
turn reducing income and employment. How- 
ever, ongoing progress in reducing the burden 
of other diseases that disproportionately af- 
fect the poor—such as cholera and diarrhoeal 
infections, tuberculosis, malaria, and other 
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Fig. 6. Prevalence of common mental illnesses by country. For each country, the percentage of the population estimated to have a (left) depressive disorder or 
(right) anxiety disorder is shown at a given point in time against that country’s log gross domestic product (GDP) per capita. Each scatter point represents one 
country. The line shown is an ordinary least squares regression line of country prevalence rates on a constant and log GDP per capita. Prevalence rate data come 
from the Global Burden of Disease Study, 2017 (http://ghdx.healthdata.org/gbd-results-tool). GDP per capita data are for 2017, measured in constant 2017 
international $, and come from the World Bank’s World Development Indicators dataset (https://databank.worldbank.org/source/world-development-indicators). 


insect-borne diseases—will provide a counter- 
vailing force likely to improve mental health 
among the poor. The COVID-19 pandemic has 
the potential to set back progress in reducing 
the enormous burden of these diseases. 


Social media 


The spread of mobile phones and the internet 
opens up opportunities for poverty alleviation 
(112) and new ways to deliver mental health 
care. However, some of these technologies 
may pose new threats to mental health. Al- 
though more causal evidence is needed, some 
studies have found that depression is corre- 
lated with internet addiction and with the in- 
tensity of use of social media among young 
adults and adolescents (113, 114). Recent ran- 
domized interventions show that deactivat- 
ing social media accounts for 4 weeks led to 
0.1 SD reductions in depression and anxiety 
scores (115, 116). Increased access to the inter- 
net and social media among the global poor, 
although presenting enormously valuable be- 
nefits, may therefore also pose some threat to 
mental health among the poor (11%). 


Implications for research and policy 


Because mental health and poverty are inti- 
mately linked, interdisciplinary collaborations 
between mental-health researchers and social 
scientists studying poverty are essential (Box 4). 
Evaluations of economic interventions should 
carefully measure impacts on mental health, 
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and vice versa. An example of such work is 
Bhat et al., in which a team of psychiatrists 
and economists followed up on psychotherapy 
clinical trials in India and deployed the tools of 
behavioral economics and psychiatry to study 
long-run effects of psychotherapy on mental 
health, economic well-being, and decision- 
making (118). 


Policy tools 


There is a strong economic case for investing 
in the mental health of people in poverty. A 
recent meta-analysis showed that mental health 
interventions in low- and middle-income coun- 
tries, including psychotherapy and pharma- 
cotherapy, had similar or larger effects on 
employment than economic interventions 
such as cash transfers, job training programs, 
and multifaceted antipoverty programs (73). 
Yet, they were an order of magnitude less ex- 
pensive to deliver. Such treatments could there- 
fore be the most cost-effective antipoverty 
intervention, at least among the subset of 
people who are mentally ill. However, we know 
little about how to optimally combine, dose, se- 
quence, and target economic and mental health 
interventions. For example, combining psycho- 
logical and economic interventions may lead to 
treatment effects that are greater than the sum 
of their parts. In particular, improved mental 
health could increase the economic returns of 
receiving cash or asset transfers by improving 
decision-making and productivity. 
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Recently, innovative studies have compared 
the effects of providing psychotherapy, cash 
support, or both among low-income popula- 
tions. An RCT measured the effect of 8 weeks 
of CBT and/or $200 in cash support to 999 
criminally engaged men in Liberia (119). Al- 
though the psychotherapy targeted antisocial 
behavior rather than mental illness per se, the 
study found that the combination of cash trans- 
fer and psychotherapy improved an index of 
self-regard and mental health by 0.2 SD a year 
later (P = 0.024), accompanied by a modest 
reduction in depression and psychological dis- 
tress (-0.11 SD, P = 0.24). The combined treat- 
ment not only reduced antisocial behavior but 
also increased patience and self-control, whereas 
neither cash nor therapy alone had detectable 
effects. However, none of the treatments ap- 
parently influenced consumption or income a 
year later. More studies along these lines would 
be valuable. 


Treatment gaps 


In poor countries, the fraction of individuals 
diagnosed with depression and anxiety who 
do not receive treatment often exceeds 90% 
(11-14). Such treatment gaps likely result from 
a combination of poor supply and low demand 
for mental health services. 


Increasing supply 


Resources for mental health care are extreme- 
ly limited in low-income countries (Fig. '), and 
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Box 4. Priorities for future research on poverty and common mental illnesses. 


1. Measurement of mental health in economic surveys to estimate 


The comparative impacts of diverse economic interventions, such as insurance, social safety, 
and employment support, relative to cash transfers 


The longer-run effects of antipoverty programs 
The effects of absolute versus relative poverty 


The effect of technological change and globalization on high and low-wage workers 
The impact of the looming global economic recession resulting from COVID-19 


2. Measurement of economic outcomes in intervention studies for depression and anxiety, 


which includes 


Income, labor supply, productivity, and profits from self-employment 
Economic preferences and beliefs; investment and savings behaviors 
Household expenditures and consumption, including within-household allocation of resources 


3. Evaluations of interventions to reduce stigma and to boost demand for mental health care, 


which includes 


Diverse mental health literacy approaches, from mass-media campaigns to grass-root awareness building 
Subsidies and incentives for seeking and engaging with mental health care 
The effects of interventions on marginalized and underserved communities 


4. Evaluation of technologies to increase the coverage of effective psychotherapies, 


which includes 
Text, phone, or video delivery 


Digital approaches to training, supervision, and quality assurance for frontline providers 
Artifical intelligence bot-based and other guided self-help approaches, adapted to different languages and 


cultural contexts 


5. Evaluation of interventions to interrupt the intergenerational transmission of poverty and 


mental illness, such as through 


School mental health-promotion programs and measurement of effects also on educational attainment, 


labor supply, productivity, and earnings 


Treatment of parental mental illness and measurement of effects on children’s cognitive and educational 


outcomes 


people living in poverty often lack access 
to basic mental health care (120). For ex- 
ample, some estimates suggest that India 
has only 3900 psychiatrists for more than 
1.3 billion people, and 13 psychiatrists serve 
Zimbabwe's 14 million people (121). However, 
cost-effective and scalable strategies for treat- 
ing mental illness in low-resource settings 
do exist. Evidence from multiple countries 
shows that “psychosocial” treatments such 
as manualized talk therapies can be highly 
effective at low cost, even when delivered 
by nonspecialist community health work- 
ers (122, 123). An even more pared-down— 
but still effective—approach is the “friendship 
benches” of Zimbabwe, in which nonspecialist 
health workers (popularly known as “commu- 
nity grandmothers”) provide problem-solving 
therapy with components of behavior activa- 
tion to patients (124). 
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Reliance on in-person training and super- 
vision of the community health workers by 
experts is a major barrier to scaling these 
approaches. The Empower initiative seeks to 
address these structural barriers through the 
deployment of digital platforms to enable 
frontline workers to learn, deliver, and master 
evidence-based brief psychological treatments 
for mental health problems, using innovative 
practices such as coach-supported learning and 
peer-supervised quality assurance, all delivered 
through digital tools (225). Empower will roll 
out the brief behavioral activation treatment 
developed and evaluated in India and Nepal 
(Box 3) to community health workers in India 
and the United States from 2021 onward. 


Stimulating demand 


Even in settings with affordable and effective 
mental health services, many people do not 
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seek or adhere to treatment (126). People 
often lack mental health literacy and may 
not possess basic information about mental 
health conditions and their risk factors, symp- 
toms, and potential treatment options (127). 
Stigma and shame can further depress de- 
mand for mental health services. But there 
are examples of successful community-based 
programs that increase mental-health liter- 
acy and boost the share of mentally ill indi- 
viduals who seek treatment; for example, the 
VISHRAM (Vidarbha Stress and Health Pro- 
gramme) initiative, rolled out in 30 villages 
in rural India, deployed “change agents”— 
typically, persons already playing leadership 
roles in the villages—to engage the commu- 
nity in conversations around mental health 
and illness, using a range of contextually ap- 
propriate strategies such as group discussions; 
the program led to a sixfold increase in help- 
seeking by persons with depression in just 
18 months (128). A priority for future work 
should be the evaluation of such programs 
at scale, as well as testing approaches that 
bundle mental health treatments with other un- 
stigmatized services, subsidizing or even reward- 
ing the take-up of treatment, or using remote 
technologies such as app-based therapy that 
are less likely to expose individuals to stigma. 


Poverty traps 


A classic idea in development economics is 
the existence of a “poverty trap” (129). This is 
the idea that below a certain threshold level 
of wealth, people are not able to earn and save 
their way out of poverty. They remain trapped 
in poverty precisely because of the depriva- 
tions associated with poverty and not because 
of any intrinsic lack of ability. Such poverty 
traps could exist for many reasons. For exam- 
ple, the very poor may not be able to afford 
enough food to be productive at work. Re- 
cent evaluations of multifaceted antipoverty 
programs have provided some evidence for 
the existence of such poverty traps (39, 41). 
However, the underlying mechanisms are un- 
clear. The evidence of a bidirectional causal 
relationship between poverty and mental health 
presented in this Review suggests that mental 
health could be a key mechanism: There could 
be psychological poverty traps. Some of those 
living in poverty may be ensnared in a vicious 
cycle of poverty and mental illness (5). If this 
is the case, a one-time economic or psycho- 
logical intervention of sufficient magnitude 
may “push” people into a state of sustainably 
higher income and better mental health. 


Conclusion 


The causal relationship between poverty and 
mental health we have described could not be 
more pertinent than in the ongoing pande- 
mic, which has already adversely affected both 
of these outcomes. Given the surge of deaths 
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Fig. 7. Availability of mental health workers across countries. [he mean numbers of psychiatrists, social 
workers, psychologists, and nurses working in the mental health sector (per 100,000 people) are shown 
for countries in each of the four income categories used by the World Bank. Data on mental health workers 
come from the WHO's Global Health Observatory (http://apps.who.int/gho/data/node.main.MHHR?lang=en) 
and are for the most recent year available (which ranges from 2013 to 2017). 


of despair in the United States in the wake of 
the Great Recession of 2008 (31), we have 
grave concerns about the mental-health impli- 
cations of the economic recession confronting 
the world. The pandemic has disproportion- 
ately affected the poor and may have lasting 
adverse impacts on their economic and men- 
tal well-being. A massive investment in mental 
health was long overdue even before the pan- 
demic and has become critically urgent now. 
Beyond more money, this is also an important 
Opportunity to invest wisely in lower-cost in- 
novations that provide quality care to low- 
income and disadvantaged communities and 
to integrate economic interventions with men- 
tal health care to reduce historic disparities in 
both wealth and mental health. 
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CiBER-seq dissects genetic networks by quantitative 
CRISPRi profiling of expression phenotypes 
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INTRODUCTION: Systematically profiling the 
effects of genetic perturbations is a powerful 
approach that has revealed the molecular 
basis for a wide range of biological phenomena. 
The simple, programmable DNA recognition 
of CRISPR-Cas9 enables genome-wide genetic 
analysis in human cells and many other sys- 
tems. Cas9 is guided by a short RNA to a com- 
plementary sequence in the genome, where it 
can introduce mutations or alter gene expres- 
sion. Pooled libraries of guide RNAs (gRNAs) 
that individually target each gene in the genome 
allow us to introduce genetic perturbations 
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effects caused by individual guides in these 
pooled libraries and linking these phenotypes 
back to the associated gRNA, thereby finding 
the gene that is responsible. 


RATIONALE: Molecular phenotypes such as 
gene expression changes provide a clear and 
sensitive measure for many cellular processes. 
We sought a general approach to profile how 
the expression of a particular gene of interest 
changed when other genes were perturbed. 
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CRISPRi with barcoded expression reporters. CRISPR-Cas9 gRNA cassettes are linked with transcriptional 
reporters containing specific barcodes. The RNA-to-DNA ratio for each barcode, measured by deep 
sequencing, reveals the reporter expression phenotype induced by each gRNA. 
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We began with a library of gRNAs, each dis- 
rupting one gene, and linked these guides with 
an expression reporter containing a guide- 
specific nucleotide barcode. gRNAs that alter 
reporter expression will change the abundance 
of the expressed RNA barcode specifically 
associated with that guide. Deep sequencing 
of these expressed barcodes quantifies each of 
these guide-specific reporter expression effects 
individually within a pooled, complex popu- 
lation. We have implemented this strategy 
by combining CRISPR interference (CRISPRi) 
with barcoded expression reporter sequencing 
(CiIBER-seq). 


RESULTS: We used CiBER-seq to profile the 
responses of several yeast promoters tied to a 
range of biological functions. Each promoter 
yielded a distinct pattern of responses that 
could be understood in terms of its known 
function and regulation. For example, we re- 
discover the control of E76 expression by 
regulatory ubiquitylation and connect the bud 
scar protein Cwpl to other genes required for 
budding and cytokinesis. Our analysis of the 
HIS4 promoter, a well-characterized target of 
the integrated stress response, yielded a range 
of genetic perturbations that activate this path- 
way by causing the accumulation of uncharged 
transfer RNAs (tRNAs). We also uncovered a 
notable role for tRNA depletion in this re- 
sponse, as impaired tRNA biogenesis activated. 
HIS4 expression through a distinct pathway. 
In order to understand this regulation, we car- 
ried out genetic interaction analysis and looked 
for quantitative deviations in CiBER-seq pro- 
files caused by the introduction of a second 
genetic perturbation. We also developed an 
indirect CIBER-seg approach to measure trans- 
lational and posttranslational regulation, which 
both play roles in the signaling pathways up- 
stream of HIS4. 


CONCLUSION: CiBER-seq produces compre- 
hensive phenotypic profiles that offer insights 
into gene function and regulation. These high- 
throughput and quantitative phenotypic mea- 
surements are also well suited for the systematic 
measurement of genetic interactions, which 
contain rich information about the operation 
of biological processes. This approach can be 
applied to study a wide range of transcrip- 
tional, translational, and posttranslational reg- 
ulatory responses, and it has the potential to 
shed light on many areas of biology. 
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CiBER-seq dissects genetic networks by quantitative 
CRISPRi profiling of expression phenotypes 
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To realize the promise of CRISPR-Cas9-based genetics, approaches are needed to quantify a specific, 
molecular phenotype across genome-wide libraries of genetic perturbations. We addressed this 
challenge by profiling transcriptional, translational, and posttranslational reporters using CRISPR 
interference (CRISPRi) with barcoded expression reporter sequencing (CiBER-seq). Our barcoding 
approach allowed us to connect an entire library of guides to their individual phenotypic consequences 
using pooled sequencing. CiBER-seq profiling fully recapitulated the integrated stress response 

(ISR) pathway in yeast. Genetic perturbations causing uncharged transfer RNA (tRNA) accumulation 
activated ISR reporter transcription. Notably, tRNA insufficiency also activated the reporter, independent 
of the uncharged tRNA sensor. By uncovering alternate triggers for ISR activation, we illustrate how 
precise, comprehensive CiBER-seq profiling provides a powerful and broadly applicable tool for 


dissecting genetic networks. 


RISPR-Cas9 has emerged as a powerful 
and versatile tool for creating precise, 
programmable genetic perturbations 
(1). CRISPR-based knockout (2-4) and 
knockdown (5) approaches enable sys- 
tematic, genome-wide genetic analysis in a 
wide range of cells and organisms. The Cas9 
protein binds short RNAs that guide this 
protein-RNA complex to complementary sites 
in the genome, where it can induce mutations 
or silence promoters (7). Libraries of guide 
RNAs (gRNAs), each targeting one individual 
gene, can be used to create a population of 
cells that each express one distinct guide (6-8). 
The phenotype of each cell then reflects the 
impact of the single guide that it expresses. 
It is straightforward to identify guides that 
affect cell survival or proliferation, but growth 
is a crude phenotype that is poorly suited to 
address many important biological questions 
(9). The scope of CRISPR-based genetics would 
be expanded by improved techniques to mea- 
sure more-specific and relevant phenotypes 
across this diverse population and link these 
measurements back to individual guides. 
Molecular phenotypes, such as the expres- 
sion level of a critical gene or the stability of a 
key protein, provide a focused and sensitive 
gauge for many aspects of cell physiology. We 
devised an approach for profiling a transcrip- 
tional, translational, or posttranslational regula- 
tory response comprehensively across CRISPR- 
based perturbations genome-wide. We adapted 
barcoded expression reporters (JO) to produce 
quantitative phenotypic profiles from bulk se- 
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quencing of highly diverse populations. These 
profiles also enabled high-precision genetic 
interaction analyses, which use double-mutant 
phenotypes to map the structure of regulatory 
networks (11). This direct sequencing approach 
offers substantial advantages over fluorescent 
reporters for CRISPR-based genetics. Fluores- 
cence phenotypes are typically analyzed by cell 
sorting (9), which imposes bottlenecks on the 
cell population size and discretizes quanti- 
tative fluorescence measurements into a few 
broad gates. Our approach circumvented both 
of these limitations. It also complemented 
broader expression profiles from single-cell 
approaches such as Perturb-seg (72, 13), CRISP- 
seq (14), and CROP-seq (15), which cannot 
currently capture enough cells to approach 
genome-scale coverage. Thus, better ways to 
profile molecular phenotypes across genome- 
scale guide libraries stand to benefit many 
areas of biology. Here, we combined CRISPR 
interference (CRISPRi) with barcoded expres- 
sion reporter sequencing (CiBER-seq) to measure 
cellular responses provoked by gRNA-mediated 
knockdown. 


Results 

Barcoded expression reporters linked 
transcriptional responses with gRNA-mediated 
perturbations in massively parallel screens 


The development of CiBER-seq relied on mas- 
sively parallel measurements of reporter ex- 
pression in a diverse population by the deep 
sequencing of short-sequence barcodes embed- 
ded in the reporter transcript (10). Each bar- 
code was linked to one gRNA, and the RNA 
abundance of each barcode reflected reporter 
expression levels in the cells containing that 
barcode and expressing the associated guide 
(fig. S1). Barcode RNA levels are also affected 
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by the variable abundance of cells containing 
the barcode and its associated guide and by 
nonspecific disruptions of cell viability or 
transcription. To correct for these confound- 
ing effects, we paired each barcoded reporter 
with a barcoded control transcript driven from 
a housekeeping promoter, which will experi- 
ence the same overall cellular environment. 
We implemented CiBER-seq in budding yeast 
and delivered guide-reporter pairs on autono- 
mous low-copy plasmids. We tagged each guide 
sequence in a comprehensive yeast gRNA 
library with pairs of random nucleotide barcodes 
and determined the linkage between barcode 
pairs and guides by long-read next-generation 
sequencing (Fig. 1A). Our library contained 
10 guides per gene (~60,000 in total) (16) and 
~240,000 distinct barcode pairs (approximately 
four per guide on average). By linking multiple 
barcodes with each guide, we were able to 
obtain independent measurements of guide 
effects in a single experiment. 

We drove gRNA expression from a tetracycline- 
inducible promoter (17), which allowed us to 
specifically measure the difference in barcode 
expression before and after guide induction. 
This experimental design allowed us to exclude 
technical effects resulting from the sequence of 
a barcode. Inducible CRISPRi also facilitated 
the measurements of guides with strong fitness 
effects by allowing us to propagate cells without 
guide induction and thereby avoid the pre- 
mature loss of guides with growth defects. 
We analyzed the multifactorial barcode abun- 
dance data in a generalized linear model frame- 
work, implemented by a massively parallel 
reporter assay linear model (mpralm) (18), 
which estimates the change in reporter ex- 
pression caused by CRISPRi guide induction 
while controlling for baseline reporter expres- 
sion and changes in the paired housekeeping 
barcode. This framework allowed us to incor- 
porate replicate measurements, and we carried. 
out all CIBER-seq experiments in biological 
duplicate. It also allowed us to identify guides 
causing a statistically significant change in 
normalized reporter expression, using statisti- 
cal approaches developed for gene expression 
measurements in limma (linear models for 
microarray data) (19). 

To explore whether CiBER-seq could dissect 
a range of regulatory circuits, we profiled the 
responses of four promoters—PUVET6), P(>CWPI), 
P(PHOS5), and P(HIS4)—whose gene products 
encompass a wide range of cellular roles. The 
protein product of MET6 catalyzes the conver- 
sion of homocysteine to L-methionine and plays 
a central role in sulfur and one-carbon metab- 
olism. The activity of PUVET6) is controlled by 
the transcription factor Met4, which is ubiq- 
uitinated and inactivated by the SCFM®T3°0 
complex (20) (Fig. 1B). In agreement with the 
inhibitory effect of SCF’"'*°, CiBER-seq analysis 
of PUVET6) regulation identified several guides 
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Fig. 1. Barcoded expression 
reporters linked transcriptional 
responses with gRNA-mediated 
perturbations in massively 
parallel screens. (A) Schematic 
of CIBER-seq profiling experiment. 
Barcode expression Is driven by a 
query promoter or matched P(PGK1) 
normalizer. Changes in relative 
barcode expression after gRNA 
induction link individual gRNAs with 
their corresponding phenotypic 
effect. (B) Model diagram of the 
SCFMETS° complex. Subunits with at 
least one significant guide [qg < 0.05 
and >1.5-fold P(MET6) increase] 
are colored green. (C) Genome-wide 
CiBER-seq profile of P(MET6) 
transcription changes upon gRNA 
induction, relative to P(PGK1). Each 
point represents one gRNA, with 
guides against SCF="? complex or 
MET6 itself colored as indicated. 
Lines indicate cutoffs for significant 
(q < 0.05) and substantial (>1.5-fold 
change) effects. (D) Genome-wide 
CiBER-seq profile of P(CWPI) 
transcription, as in (C), with guides 
against cytokinesis GO term genes 
and CWP1 itself colored as indicated. 
Significant guides against cyto- 
kinesis genes MLC1, CDC10, and 
CHS2 are labeled; the guide with the 
strongest q value was selected for 
validation. (E) Endogenous CWP1 
mRNA measurements before and 
after MLC1, CDC10, and CHS2 
knockdown. Error bars are the stan- 
dard deviation across N = 3 
biological replicates. (F) GO analysis 
of each CiBER-seq query promoter 
profile. Guides were filtered by 

q < 0.05 and >2-fold increase, and 
resulting gene lists were analyzed for 
GO category overrepresentation 
using Fisher's exact test with false 
discovery rate (FDR)-adjusted 

P < 0.05. The most statistically 
Significant entry was chosen from 
chains of hierarchically nested 
categories, and all chains with 
Significant categories for any 
promoter are represented in the plot. 
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targeting this complex that increased PUWET6)- 
driven barcode expression relative to P(PGKD 
controls (Fig. 1, B and C, fig. S2A, and data S1). 
We found a 32-fold enrichment (g < 1.3 x 107”) 
for this functional category among guides ac- 
tivating PUVET6), spanning nearly every sub- 
unit of this complex. 
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CiBER-seq analysis indicated that P(>CWPI) 
activity was increased by genetic perturbation 
of cytokinesis. The Cwpl1 protein localizes to 
bud scars, the cell wall structures formed at 
the site of cytokinesis in budding yeast (2). 
Appropriately regulated transcription of CWPI 
is important for this localization, although the 
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mode of its regulation is unknown. We found 
that gRNAs targeting genes involved in budding 
and cytokinesis—including septins, myosin, and. 
chitin synthase (22)—all activated P(CWPD (Fig. 
1D and data S1). We validated that knockdown 
of three representative genes, MLCI, CDCIO, and 
CHS2, all induced the endogenous CWPI gene 
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(Fig. 1E). Thus, P(CWPD is activated by a coherent 
set of gRNAs that is consistent with the link 
between Cwpl and budding. 

Each of the four promoters displayed a 
distinctive and largely nonoverlapping phe- 
notypic profile consistent with their known 
regulation and function (Fig. 1F and tables 
S1 to S4). CIBER-seq profiling of P(PHO5), a 
model promoter for early studies of chromatin- 
based transcription regulation (23), identified 
a collection of genes involved in nucleosome 
remodeling and RNA polymerase II initiation 
(fig. S2B and data S1). Similarly, P(HIS4) CIBER- 
seq yielded a complex profile of responses, in- 
cluding activation of P(HIS4) by knockdown of 
aminoacyl-tRNA synthetases (fig. S2C and data 
S1). The HIS4 gene, which encodes an amino 
acid biosynthetic enzyme, is a well-characterized 
transcriptional target of the yeast integrated 
stress response (ISR), also known as the general 
amino acid control (GAAC) response. This deeply 
conserved pathway up-regulates biosynthetic 
genes in response to elevated levels of un- 
charged tRNAs that arise during amino acid 
starvation (24). It is therefore easy to under- 
stand how directly impairing tRNA charging 
by knockdown of the synthetase enzyme that 
carries out this reaction would trigger the ISR 
and thereby activate PLHIS4) as well as PUVET6), 
which is also an ISR transcriptional target. 

We also observed clear P(HIS4) and PUVET6) 
activation from guides targeting RNA polymer- 
ase III, a scenario that should result in the 
absence of tRNA rather than the accumulation 
of uncharged tRNA. Because these uncharged 
tRNAs are sensed by the Gcn2 kinase, we in- 
vestigated how the CiBER-seq profile of PHHIS4) 
would differ in a gen2A knockout strain. Dele- 
tion of this sensor kinase eliminated the re- 
sponse of P(H/IS4) to guides against tRNA 
synthetases, whereas the effects of RNA poly- 
merase III knockdown remained at least as 
strong (Fig. 1F, fig. S2D, and data S1). Because 
we did not observe any effect of these guides 
on P(PHOS5) or P(CWP1I), it seemed unlikely 
that our results reflected a change in the ac- 
tivity of the PLPGKI) promoter that we used 
as a common point of reference in these 
experiments. Nonetheless, we set out to profile 
P(HIS4) activity more directly, normalizing 
the RNA expression level of the barcode against 
the DNA abundance rather than a control 
promoter. 


CiBER-seq recapitulated known genetic 
regulators of ISR and identified regulators 
related to tRNA insufficiency 


CiBER-seq profiling of PCHTS4) and P(PGKT) 
individually, with normalization against bar- 
code DNA abundance (Fig. 2A and fig. S3), 
allowed us to unambiguously attribute observed. 
transcriptional responses to one promoter. Our 
unbiased gene ontology (GO) analysis captured. 
specific and distinct categories of PPGKT) and 
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P(HIS4) activators (Fig. 2B and tables S6 and 
S7), in agreement with a direct comparison of 
P(HIS4) and P(PGKY) effects during CRISPRi 
perturbation (Fig. 2A and fig. S3, A and B). 
Although guides that activated P(HIS4) did 


not induce P(PGKD, knockdown of general 
RNA polymerase II transcription reduced 
expression in both reporters (fig. S3, A to E). 
Additionally, analysis of P(PGKJ) in isolation 
demonstrated a clear induction in response 
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Fig. 2. CIBER-seq recapitulated known genetic regulators of ISR and identified regulators related to 
tRNA insufficiency. (A) Schematic of CiIBER-seq profiling experiment, with modifications to isolate the 
regulatory effects of guide-mediated knockdown on a single promoter. (B) GO analysis of P(PGK1) and 

P(HIS4) with DNA normalization, as in Fig. IF. (©) Comparison of CiBER-seq profiles for P(PGK1)—normalized 
and DNA-normalized P(H/IS4) CiBER-seq analysis. Guides with significant effects in either profile are shown, and 
all GO terms highlighted in (B) are condensed to one group. (D) Genome-wide CiBER-seq profile of P(H/S4) 


transcription relative to DNA barcode abundance. Each 


point represents a different gRNA, analyzed to determine 


the change in P(HIS4) barcode RNA levels upon gRNA induction, normalized against the change in barcode DNA. The 
lines indicate cutoffs for significant (q < 0.05) and substantial (>1.5-fold change) effects. Guides are color coded 


by relevant statistically overrepresented GO terms with 


strong effects on P(HIS4). (E) Schematic of biological 


complexes with significant guides [as in (D)], with colors corresponding to GO terms. Subunits without a significant 
guide are displayed in gray. Pol Ill, polymerase Ill; THII transcription factor for Pol Ill. 
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Fig. 3. tRNA insufficiency triggered HIS4 transcription independently of 
elF2a phosphorylation and Gcn2 kinase. (A) Comparison of P(H/S4) 
CiBER-seq profiles before and after 3AT treatment, analyzed and colored as in 
Fig. 2D. (B) Schematic outlining expected ISR responses for CRISPRi 
knockdown of genes in various functional categories, before and after 3AT 
treatment. (C) Western blot for elF2a phosphorylation relative to hexokinase 
(Hxk2) loading control. Knockdown of amino acid biosynthesis (/LV2) and 
aminoacyl tRNA charging (HTS1) increases elF2a phosphorylation, whereas 
knockdown of RNA polymerase Ill (RPC31) or tRNA processing (POP6 and 
SEN15) does not. (D) Quantification of elF2a phosphorylation relative to 
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hexokinase loading control across N = 3 biological replicates. (E) Change in 
endogenous HIS4 mRNA levels after guide induction measured by qPCR. 
Endogenous HI/S4 activation by ILV2, HTS1, RPC31, POP6, and SEN15 
knockdown is completely GCN4 dependent. H/S4 activation by ILV2 
knockdown is also completely GCN2 dependent, whereas effects of RPC31 
and POP6 knockdown are entirely GCN2 independent, and HTSI and SENI15 
show intermediate dependency. Error bars represent standard deviation for 
N = 3 biological replicates. (F) As in (E) for guides targeting three distinct 
RNA polymerase III subunits. Error bars represent standard deviation for 

N = 3 biological replicates. 


to knockdown of glycolytic enzymes (fig. S3, 
A to C), which suggests a homeostatic trans- 
criptional activation of PGKI expression in 
response to impaired glycolysis. Although 
the P(PGK]) promoter is often used with the 
intent to produce constitutive expression, it is 
subject to regulation (25), and Pgk1 activity 
increases when glycolysis is inhibited (26). 
Our P(HIS4) CiBER-seq profile with DNA nor- 
malization confirmed observations from a pre- 
vious profile based on PGK) normalization— 
namely, that well-characterized defects in tRNA 
charging and newly identified deficiencies in 
tRNA synthesis affected P(HIS4) specifically 
(Fig. 2C). We thus surveyed the molecular com- 
plexes whose knockdown activated PCHIS4) 
(Fig. 2D). We identified 35 different gRNAs 
targeting amino acid biosynthesis pathways, 
along with guides against 17 of the 20 of the 
aminoacyl-tRNA synthetases (Fig. 2, D and E), 
which are all expected to interfere with tRNA 
charging. We also observed P(H/IS4) activation 
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in response to knockdown of each individual 
component of the eIF2 translation initiation 
complex (Fig. 2, D and E); depletion of these 
proteins directly increases translation of the 
Gcn4 transcription factor, leading to P(HIS4) 
induction (24, 27). 

We also found P(HIS4) activation in re- 
sponse to guides targeting many steps of tRNA 
biogenesis, which suggests that overall tRNA 
depletion triggered the ISR transcriptional 
program. Guides targeting subunits of RNA 
polymerase III, which transcribes tRNAs, as well 
as the tRNA processing complex ribonuclease 
(RNAse) P and the tRNA splicing endonuclease 
(SEN) complex (28) all activated PCHIS4) trans- 
cription (Fig. 2, A and C). In contrast to known 
ISR triggers, these genetic perturbations should 
not lead to the accumulation of uncharged 
tRNAs but should reduce overall tRNA levels. 
Although depletion of initiator methionyl-tRNA 
can induce P(HIS4) (29) and RNA polymerase 
III defects can lead to initiator methionyl-tRNA 
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depletion (30), this effect could not explain our 
observations here. Initiator tRNA does not con- 
tain an intron, and so the loss of SEN should 
not reduce initiator tRNA levels. To further ex- 
clude initiator tRNA depletion as an explanation 
for the effects of RNA polymerase III knock- 
down, we overexpressed initiator tRNA during 
CRISPRi-mediated ISR activation (fig. $3, J and 
K). Impaired tRNA transcription showed no 
particular susceptibility to suppression of the 
ISR response by high-copy overexpression of 
initiator tRNA, relative to disruption of amino 
acid biosynthesis or tRNA charging (fig. S3K). 
Thus, elongator tRNA depletion can directly 
activate ISR transcription, perhaps through its 
effects on translation elongation. 


tRNA insufficiency triggered HIS4 transcription 
independently of elF2a phosphorylation and 
Gen2 kinase 


Because the ISR is not activated during log- 
phase growth, our initial CiIBER-seq data did 
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Fig. 4. Perturbations of the ARP2/3 complex prevented ISR activation by 
HTS1 or RPC31 knockdown. (A to C) Pair-wise comparison of P(HIS4) CiBER-seq 
profiles between ISR activation by 3AT treatment, HTS knockdown, and RPC31 
knockdown. CiBER-seq profiles represent changes in the guide effect on 
P(HIS4) expression in the context of an ISR activator (as in Fig. 3A) relative to 


not identify expression changes caused by loss 
of canonical, positive transducers of ISR signal- 
ing, such as GCN2 and GCN4. To uncover the 
phenotypes of these and other ISR pathway 
genes, we next looked for regulators whose 
depletion would block P(HIS4) activation 
triggered by the toxic histidine analog 3-amino- 
1,2,4-triazole (8AT), which blocks histidine bio- 
synthesis (Fig. 3, A and B). Guides that activated 
P(HIS4) under replete nutrient conditions did 
not further elevate P(HIS4) expression upon 
3AT treatment (Fig. 3A). We inferred that 
Gcn4-mediated activation is saturated under 
these conditions; however, knockdown of Gen4 
degradation factors, such as PCL5 (31), can en- 
hance 3AT-mediated PCHIS#) induction. Mean- 
while, guides that reduced P(H/S4) transcription, 
including guides targeting the core RNA poly- 
merase II transcription machinery (fig. $2, A 
to C), did not interfere with PCHTS4) activation 
upon 3AT treatment. We therefore reasoned 
that guides affecting amino acid biosynthesis, 
as well as tRNA charging, transcription, and 
processing, each saturated P(HIS4) tran- 
scription through a shared, GCN4-dependent 
mechanism. 
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Because a hallmark of ISR activation is 
the phosphorylation of elF2a, we investigated 
whether tRNA insufficiency provoked this 
response. Although CRISPRi knockdown of 
the amino acid biosynthetic enzyme JLV2 or 
the histidyl-tRNA synthetase HTSI both 
induced eIF2a phosphorylation, knockdown 
of the RNA polymerase III subunit RPC31 or 
tRNA processing and maturation factors 
POP6 and SENI5 did not (Fig. 3, C and D, 
and fig. S4A), consistent with previous obser- 
vations (30). Deletion of GCN2, the only known 
elF2a, kinase in yeast (32), completely blocked 
eIF2a phosphorylation in all conditions tested 
(fig. S4B). Activation of P(HIS4) in the absence 
of eIF2a phosphorylation further distinguished 
the response to tRNA depletion upon RPC31 
knockdown and the classical ISR pathway. 

We thus tested the genetic requirements 
for activation of endogenous HIS4 transcription 
in response to these CRISPRi-mediated pertur- 
bations. Deletion of GCN4 blocked HIS4 activa- 
tion by RPC31, POP6, and SEN15 knockdown 
as well as ILV2 and HTS1 knockdown, and so 
the effects of tRNA insufficiency and defects in 
tRNA maturation reflected a GCN4-mediated ISR. 
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the change caused by the guide in isolation (as in Fig. 2D). Plotted guides were 
significant (gq < 0.05) in at least one of the three epistatic profiles. All guides, 
regardless of significance, were used to calculate the pairwise Pearson correlations. 
(D) Coverage of actin cytoskeleton and ARP2/3 complex by guides that block 

ISR induction during HTSI or RPC31 knockdown but not 3AT treatment. 


Furthermore, GCN2 deletion produced distinct 
effects across these five CRISPRi knockdowns 
(Fig. 3E). Deletion of GCN2 completely blocked 
HATIS4 induction in response to JLV2 depletion, 
consistent with canonical models of ISR signal- 
ing (33). Although HTSI knockdown triggered 
strong eIF2a phosphorylation, GCN2 deletion 
only partially abrogated its transcriptional ef- 
fects; SENI5, which did not induce elIF2a 
phosphorylation, nonetheless showed a weakened. 
response in the deletion as well. Finally, RPC31 
and POP6 knockdowns induced HIS4 trans- 
cription independent of GCN2, as we expected 
on the basis of the lack of e[F2o, phosphory- 
lation and P(H/S4) CiBER-seq analysis in a 
gcn2 null background (figs. S2D and S4C). 
Epistatic characterization of additional RNA 
polymerase III subunits RPCI7 and RPO31 
confirmed that tRNA insufficiency broadly 
activated P(HIS4) by means of a GCN2- 
independent mechanism and that this was 
not a RPC3I-specific effect (Fig. 3F). Thus, trans- 
lation elongation defects arising from impaired 
tRNA recruitment directly triggered GCN4- 
mediated transcriptional responses. Knockdown 
of tRNA synthetases activated the ISR by this 
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pathway in parallel with the GCN2-dependent 
response to uncharged tRNAs. 


Perturbations of the ARP2/3 complex prevented 
ISR activation by HTS1 or RPC31 knockdown 


Given the fact that both Gcn2 activity and 
eIF2a phosphorylation were dispensable for 
ISR activation in response to tRNA depletion, 
we next sought to systematically identify genes 
required for this response. We looked for genetic 
perturbations that modified P(HIS4) responses 
to different ISR triggers by performing dual- 
guide CiBER-seq—combining one guide that 
individually activated the ISR with a second 
guide from our genome-wide CRISPRi library 
to obtain quantitative, genome-wide genetic 
interaction profiles. We compared the inter- 
action profiles of the histidyl-tRNA synthetase 
HTSI, which is required for tRNA charging, 
with those of the RNA polymerase III subunit 
RPC31, which is required for tRNA transcrip- 
tion (Fig. 4, A to C, and tables S8 and S9). We 
observed saturated P(H/S4) induction after 
knockdown of HTSI or RPC31, similar to the 
P(HIS4) saturation that we observed upon 
3AT treatment (Fig. 3, A and B). Guides that 
activated the ISR on their own did not further 
enhance transcription in these dual-guide epis- 
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tasis experiments. We also observed guides that 
had no effect on their own but either suppressed 
ISR activation, such as guides against GCN4 
itself, or enhanced the strength of the response, 
including guides against the degradation factor 
PCLS5 or the ISR inhibitor YIH7 (34, 35). 

We compared the genetic interaction pro- 
files across three different ISR stimuli to test 
whether they were acting through similar or 
distinct pathways. Epistatic profiles of H7S1 
and RPC31 knockdown resembled each other 
(R? = 0.76, where R?” is the coefficient of 
determination) more closely than either of 
them resembled the profile during 3AT treat- 
ment (R° = 0.48), although all three profiles 
did overlap substantially. This pattern aligned 
with our observation that PLH/S4) activation 
by either tRNA charging defects or tRNA 
depletion were not completely GCN2 dependent, 
whereas the amino acid starvation response was 
fully GCN2 dependent (Fig. 3E). Thus, we looked 
for epistatic modifiers of P(HIS4) activation in 
response to HTS1 and RPC31 knockdown. Using 
unbiased GO enrichment analysis on the subset 
of observed guides that blocked ISR-mediated 
transcriptional activation (tables S8 and S9), we 
identified an unexpected requirement for the 
actin cytoskeletal components in this response 
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(Fig. 4). Although knockdown of actin itself 
(ACTI) or genes encoding members of the 
Arp2/3 complex (fold enrichment >14, g < 
0.002 in both dual-guide datasets) did not af- 
fect P(HIS4) transcription in normal growth 
or 3AT treatment (fig. S5A), it blocked P(HIS4) 
activation by tRNA charging defects or tRNA 
depletion (fig. S5, B and C). This could reflect 
interactions between the ISR inhibitor YIH1 and 
free actin monomers (35), or it could arise be- 
cause of nuclear actin’s role in transcription (36). 


Disrupting sumoylation enhanced the activity 
of Gcn4 


CiBER-seq analysis of P(HJS4) regulation 
provided a comprehensive view of yeast ISR 
signaling, which revealed distinct triggers 
for GCN4-dependent transcriptional responses. 
However, this approach did not allow us to 
distinguish the different layers of Gcn4 regu- 
lation. Gen4 abundance is controlled by regu- 
lated protein degradation (37) in addition to 
its well-characterized translational induction 
upon eIF2o, phosphorylation (24). We observed. 
epistatic enhancement of ISR activation from 
guides depleting the decay factor PCL5. Thus, 
we wanted to specifically analyze the regulators 
of Gcn4 stability and translation in isolation 
and separate them from other effects on PLHIS4) 
activity. To couple these protein-level pheno- 
types with a transcriptional readout suitable for 
CiIBER-seg, we returned to the synthetic, ZEM 
chimeric transcription factor used to initially 
validate barcode sequencing. Because barcode 
transcription was directly linked to the abun- 
dance of this synthetic transcription factor (fig. 
Sl, E and I), we reasoned that it could couple 
protein-level regulation with expressed RNA 
barcode abundance (Fig. 5A). 

We first profiled posttranslational control of 
Gcn4 by carrying out CiBER-seq in yeast that 
constitutively expressed a protein fusion be- 
tween Gen4 and the chimeric ZEM transcrip- 
tion factor (Fig. 5A and tables S10 and S11). 
Levels of this transcription factor should re- 
flect only posttranslational regulation of Gcn4— 
and not its translational regulation—because 
it is expressed using the PGK promoter and 5’ 
untranslated region (5/UTR). Knockdown of 
PCL5, which regulates Gen4 degradation, af- 
fected the activity of the Gcn4-ZEM post- 
translational reporter. However, neither tRNA 
charging nor tRNA biogenesis had an effect. 
Instead, disruption of sumoylation increased 
expression from P(Z) (Fig. 5, B and C). We 
identified individually significant gRNAs tar- 
geting nearly every step of the sumoylation 
pathway, including SUMO itself, SUMO- 
activating and SUMO-conjugating enzymes, 
and the Siz1 SUMO E3 ligase (38). Likewise, 
we identified guides against most compo- 
nents of the Nup84 subcomplex of the nuclear 
pore (39) that increased Gcn4-ZEM activity 
(Fig. 5B). 
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Fig. 6. The GCN4 5' leader sequence is an intrinsic biosensor of translation 
stress. (A) Schematic of indirect CIBER-seq experiment to identify mediators 

of GCN4 5’ leader translation regulation. (B) Comparison of P(H/S4) DNA-normalized 
CiBER-seq profile (Fig. 2D) versus indirect P(Z) profile generated with P(GCN4)- 
UTRecna-ZEM, with ISR activators colored according to their function, as in Fig. 2D. 


(C) Model of GCN4 5' leader sequence as an intrinsic biosensor of translation 
stress. Either tRNA insufficiency or uncharged tRNAs increase GCN4 translation. 
(D) GO analysis of indirect CIBER-seq experiments, as in Fig. 1F. Significant anno- 
tations were collected from both the GO biological process complete and 
reactome pathway annotation datasets. ORF, open reading frame. 


Sumoylation of Gen4 promotes its eviction 
from chromatin—even when it is recruited 
through heterologous DNA binding domains— 
and causes its subsequent degradation. Remov- 
al of Gcn4-ZEM from target promoters and 
degradation of the protein are both expected 
to limit transactivation of barcoded reporters 
(40, 41), and knockdown of the SUMO con- 
jugation machinery would relieve these limit- 
ing effects and enhance barcode expression. 
Elevated barcode expression during knockdown 
of the Ulp1 deconjugating protease could result 
from its role in SUMO maturation (42) or reflect 
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a more-complex requirement for a sumoylation- 
desumoylation cycle in Gen4 regulation. Our 
Gcn4-ZEM fusion did retain the Gcn4 DNA 
binding domain, in addition to the heterologous 
Zif268 DNA binding domain, which could also 
contribute to the overall profile of responses 
that we observed. 

The sites of SUMO modification on Gcen4 
are known, and unSUMOpylatable mutants 
have been reported. To directly test the role of 
SUMOpylation in regulation of Gcn4:p-mediated 
transcription, we generated a transcription 
factor fusion with the unSUMOylatable Gcn4: 
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mutation (40) and measured its activity (Fig. 
5D). The unSUMOylatable mutant caused 
higher baseline P(Z) expression that was not 
further increased upon SUMO cascade knock- 
down (Fig. 5D). Thus, Gen4-specific SUMOylation 
is reducing the activity of the Gcn4-ZEM fusion, 
and this reduction is relieved when SUMOylation 
is impaired globally or blocked by Gcn4-specific 
mutations. The role of the nuclear pore, although 
less clear, may be linked to sumoylation. In yeast, 
Ulp1 binds physically to the nuclear pore (43, 44), 
mutations in the Nup84 complex can mimic 
some ulp1 phenotypes (45, 46), and Ulp1 at the 
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nuclear pore has been implicated directly in 
transcriptional activation (47). Alternately, 
this effect may be specific to the Gen4-ZEM 
fusion, although it did not arise in other CiBER- 
seq profiles based on the ZEM transcription 
factor, as described below. 


The GCN4 5’ leader sequence is an intrinsic 
biosensor of translation stress 


We observed clear and coherent patterns of 
genetic perturbation affecting Gcn4 protein 
activity, but we saw no evidence that tRNA 
insufficiency affected its posttranslational re- 
gulation. We next explored the translational 
control of GCN4, which results from regulatory 
upstream open reading frames in the GCN4 5’ 
leader sequence (24). To capture perturbations 
that affect this translation regulation, we drove 
expression of the ZEM synthetic transcription 
factor from the GCN4 promoter and 5’ leader 
sequence and assessed guide effects on barcode 
expression (Fig. 6A). 

The same guides that activated PCH/S4), in- 
cluding those that block tRNA biogenesis as 
well as tRNA charging, also increased P(Z) in 
our CiBER-seq analysis of translational control 
through the GCN4 5’ leader (>32-fold, hyper- 
geometric P < 2.04 x 107°) (Fig. 6B, fig. S6A, 
and tables S12 and S13). By assessing this trans- 
lational response in isolation, we observed that 
effects occur in the same direction but with 
larger magnitude than P(H/S4). Thus, trans- 
lational regulation is central to ISR activa- 
tion, even when it proceeds independently of 
GCN2 and eIF2a phosphorylation (Fig. 6C). To 
exclude transcriptional effects mediated by 
the GCN4 promoter (48), we repeated this 
CiBER-seq profiling experiment with aversion 
of the ZEM transcription factor driven from 
the P(PGK7) promoter, but still containing the 
GCN4 5’ leader (fig. S6B). This profile enriched 
most of the same functional categories as the 
profile using the HIS4 promoter, including tRNA 
transcription and processing (fig. S6C). We ob- 
served an overall enrichment of shared activators 
between the two profiles (>11-fold, hypergeo- 
metric P < 1.62 x 10 ”) and specific enrichment 
in tRNA biogenesis factors (>100-fold, hyper- 
geometric P < 8.88 x 10°”). 


Discussion 


By linking CRISPRi guides with barcoded 
expression reporters, we generated quan- 
titative, genome-wide phenotypic profiles for 
specific molecular events in the yeast cell. 
Our CiBER-seq approach allowed us to ad- 
dress transcriptional, translational, and post- 
translational regulation, enabling the systematic 
genetic analysis of diverse biological processes. 
We characterized distinctive CiBER-seq pro- 
files for five different promoters, each of which 
could be understood in light of the function of 
the associated gene. We leveraged these quan- 
titative phenotypic profiles to gain insights 
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into the ISR, identifying tRNA depletion as 
a previously unappreciated trigger for this 
well-characterized pathway. 

Because translation is a resource-intensive 
biosynthetic process, most organisms sense 
translational stresses and respond with phys- 
iological changes that maintain homeostasis. 
In eukaryotes, amino acid starvation, which 
directly affects tRNA charging and translation 
elongation, triggers a global decrease in trans- 
lation initiation while also increasing the trans- 
cription of amino acid biosynthesis genes (24). 
In many cases, reduced initiation is sufficient 
to restore normal elongation profiles (49). In 
fact, when aminoacyl-tRNA synthetases are 
depleted, yeast reduces translation initiation 
until tRNA charging and utilization are balanced. 
Furthermore, in normal circumstances, these 
synthetases appear to buffer tRNA levels by 
sequestering uncharged tRNAs (50). Here, we 
found that the opposite situation—tRNA deple- 
tion in the presence of adequate synthetases 
and amino acids—triggered a GCN2-independent 
ISR. Our comprehensive genetic data, implicating 
many stages of tRNA biogenesis, suggest that 
GCN4 translation responds directly to elonga- 
tor tRNA insufficiency, rather than to depletion 
of initiator tRNA (30) or the accumulation of 
unprocessed nuclear tRNA precursors (57). Be- 
cause we have not identified other trans-acting 
regulatory pathways under these circumstan- 
ces, we propose that the GCN4 5’ leader is an 
intrinsic biosensor for translation elongation 
stress (Fig. 6C). Artificial stimuli, including tRNA 
depletion, can activate GCN4 translation in- 
dependent of eIF2a phosphorylation. In the 
natural history of budding yeast, the ISR 
likely evolved to sense nitrogen starvation 
and elicit general inhibition of translation 
through GCN2, along with GCN4-dependent 
activation of homeostatic transcriptional 
programs (24). Our work expands the range of 
stresses known to activate the ISR and points 
to a general mechanism for sensing transla- 
tional perturbations. 

Although we showed that the ISR is ac- 
tivated by artificial, genetic depletion of tRNAs, 
similar situations may arise in a natural context 
(52). Starvation and other stresses can induce 
tRNA cleavage (53, 54), and although much 
attention has focused on the positive roles of 
the resulting fragments (55), cleavage can also 
deplete the tRNA substrate and alter transla- 
tion (56). Effective tRNA depletion may also 
arise when tRNAs are sequestered in the nucleus 
and thus unavailable for cytosolic translation 
(28). Finally, an array of human disease muta- 
tions that affect tRNA biogenesis factors lead 
to neurodegenerative disorders, although the 
molecular basis for this effect is not clear 
(52, 57). Mutations in tRNA genes themselves 
can lead to tRNA insufficiency, ISR activation, 
and neurodegeneration, although in mammals 
this effect is GCN2 dependent (58, 59). 
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Our results demonstrate the power of 
CiBER-seq in elucidating the genetic archi- 
tecture of regulation in the cell. We observed 
distinct patterns of response for different 
promoters and comprehensive coverage of 
pathways and molecular complexes by guides 
targeting each individual component, which 
indicate that our approach is both specific and 
sensitive. We identified ISR activation upon 
knockdown of each subunit of RNA polymer- 
ase III along with most proteins involved in 
tRNA processing, which reveals tRNA deple- 
tion as the underlying trigger for the ISR. We 
identified other modes of regulation as well, 
such as sumoylation of Gcn4, and even detected 
the compensatory P(PGK7) activation in re- 
sponse to impaired glycolysis. Quantitative 
CiBER-seq profiles enable genetic interaction 
analysis that provides further insight into regu- 
latory networks. Correlated patterns of epista- 
sis are a powerful tool for identifying genes that 
function together in pathways and complexes, 
and likewise chemogenomic comparison be- 
tween chemical and genetic interaction profiles 
can reveal functional drug targets (J7). Here, we 
leveraged this advantage of CiBER-seq to iden- 
tify a role for actin cytoskeletal components in 
ISR signaling. More broadly, our results high- 
light the power of CiBER-seq to combine specific 
and quantitative molecular phenotypes with 
targeted genetic perturbations and thereby 
precisely dissect regulatory pathways. A similar 
Reporter-seg technique has been developed 
simultaneously with this work, and it has been 
applied to elucidate known and unknown 
pathways linking diverse stressors to the yeast 
heat shock response (60). Because the key com- 
ponents of CiIBER-seq translate into nearly any 
organism, we anticipate many biological in- 
sights will arise from broad application of our 
approach. 


Materials and methods 
Plasmid materials 


pRS416-dCas9-Mxil + TetR + pRPR1(TetO)- 
NotlI-gRNA was a gift from R. Davis (Addgene 
plasmid #73796; http://addgene.org/73796; RRID: 
Addgene_73796). pKT0139 was a gift from K. 
Thorn (Addgene plasmid #8731; http://addgene. 
org/8731; RRID: Addgene_8731). pHES836 was 
a gift from H. El-Samad (Addgene plasmid 
#89195; http://addgene.org/89195; RRID: 
Addgene_89195). pHES795 was a gift from H. 
El-Samad (Addgene plasmid #87943; http:// 
addgene.org/87943; RRID: Addgene_87943). 
pCfB2337 was a gift from I. Borodina (Addg- 
ene plasmid #67555; http://addgene.org/67555; 
RRID: Addgene_67555). pCfB2226 was a gift 
from I. Borodina (Addgene plasmid #67533; 
http://addgene.org/67533; RRID: Addgene_67533). 
pCfB2189 was a gift from I. Borodina (Addgene 
plasmid #67532; http://addgene.org/67532; 
RRID: Addgene_67532). Plasmids used in this 
study are listed in table S1. 
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Plasmid construction 
Oligonucleotides used in this study are listed 
in table S2. Plasmid assembly was carried out 
using standard molecular biology techniques 
as described below, and verified by Sanger 
sequencing. All polymerase chain reaction 
(PCR) reactions were performed using Q5 
polymerase (NEB M0491S) according to man- 
ufacturer protocols. Restriction enzymes were 
obtained from NEB and high-fidelity CHF) 
variants were used when available. DNA frag- 
ments were size-selected by gel electrophore- 
sis in 1% agarose. Gibson assembly reactions 
were carried out using NEBuilder HiFi DNA 
Assembly Master Mix (NEB E2621L) with 
DNA fragments containing homology arms 
between 15 and 20 base pairs (bp) long. DNA 
was purified and concentrated as necessary 
with DNA Clean & Concentrator (Zymo D4013). 
Transformations were performed in Stbl3 
chemically competent cells provided by the 
QB3 Berkeley Macrolab facility and plated on 
appropriate antibiotic plates for colony selec- 
tion. Plasmid DNA was purified from liquid 
cultures using DNA miniprep kit (NEB #T1010). 
PNTI660 was constructed in several steps. 
First, pDNTI601 (pRS416-dCas9-Mxil + TetR + 
pRPR1(TetO)-NotI-gRNA, AddGene #73796) 
(17) was amplified with NM717 and NM724 (J6) 
to isolate the gRNA expression cassette and the 
resulting PCR product was recircularized by 
Gibson assembly to create pNT1I646. pNTI646 
was then digested with MfeI and combined 
with oligonucleotide NI1025 to create pNTI660. 
pNTI725 was created in several steps. First, 
the promoter P(PGKI) was amplified from 
Saccharomyces cerevisiae BY4741 genomic 
DNA using NI553 and NI554, and the result- 
ing product was subcloned and used as a tem- 
plate for PPGKT) amplification. Yeast-optimized 
yECitrine coding sequence was amplified from 
PNTI189 (pKT0139 AddGene #8731) (67) using 
primers RM151 and RM155 and cloned down- 
stream of P(PGK17). Next, the P(PGKD-yECitrine 
expression fragment was isolated by diges- 
tion with Sacl and Bsrgl restriction sites, and 
PNTI660 was linearized by digestion with Mfel. 
Double-stranded DNA splints were created by 
annealing oligonucleotides RM317 and RM318 
and oligonucleotides RM319 and RM320. 
Digested P(PGK1)-yECitrine and pNTI660 were 
joined together using these splints in a four- 
piece Gibson assembly reaction. The NotI site 
for gRNA insertion was replaced with an Nrul 
site by digesting the backbone with NotI and 
then introducing single-stranded oligonucleotide 
RM321 by Gibson assembly. Finally, the Illumina 
TruSeg Read1 site was added to the beginning 
of the terminator T(4DHD by annealing single- 
stranded oligonucleotides RM323 and RM324, 
extending them using Q5 polymerase, digesting 
the backbone with AscI and introducing the 
double-stranded product into the digested 
plasmid by Gibson assembly. 
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pNTI726 was constructed by replacing P 
(PGK1) in pNTI725 with estradiol-responsive 
P(GALI). P(GALI) with an embedded Zif268 
binding site was PCR amplified from pHES836 
using primers RM348 and RM349. pNTI725 
was digested with SacI and EcoRI and amplified 
P(GALI) was then Gibson assembled into the 
backbone. 

PNTI727 was generated by inserting the 
gRNA sequence targeting P(Z) into pNTI726. 
Plasmid pNTI726 was digested with Nrul, 
single-stranded oligonucleotides RM389 and 
RM390 were annealed, extended using Q5 DNA 
polymerase, and the double-stranded product 
was introduced into the linearized plasmid by 
Gibson assembly. 

pNTI728 was generated by inserting the 
gRNA sequence targeting P(ADHT) into pNTI726, 
using the same methods as pNTI727, using 
oligonucleotides RM383 and RM384. 

pNTI729 was constructed by first amplifying 
the ZEM transcription factor from pNTI638 
(pHES795 Addgene #87943) (62) in two pieces, 
using oligonucleotide primers RM352 with 
RM353 and RM354 with RM355. The easy clone 
vector pCfB2337 (Addgene #67555) (63) was then 
digested with HindIII and the three fragments 
were combined by Gibson assembly. 

PNTI730 was constructed by replacing the 
promoter P(ADHT) in pNTI729 with the P 
(PGKI) promoter. pNTI729 was digested with 
Pvull and Nhel, P(PGKD) was amplified from 
pNTI725 using primers RM417 and RM418, 
and the two fragments were combined by 
Gibson assembly. 

pNT1731 through pNTI737, expressing gRNAs 
against ILV2, HTS1I, RPC31, NUP133, NUPI45, 
ULPI, and UBC9, respectively, were subcloned 
into the integrating plasmid pCfB2189. First, 
DNA fragments containing the gRNA expres- 
sion cassette were amplified from the pool of 
gRNA plasmids using forward primers RM639, 
RM517, RM518, and RM641 through 644, re- 
spectively, that anneal to the specific nucleotide 
barcode sequence, and a reverse primer RM519 
that binds downstream of the gRNA scaffold. 
Fragments were amplified with Gibson homol- 
ogy arms on each end. The integration vector 
pCfB2189 (Addgene #67532) (63) was digested 
with HindIII and gRNA-expressing fragments 
were transferred into this backbone by Gibson 
assembly. 

pNTI?738 for initiator methionyl-tRNA over- 
expression was constructed by amplifying the 
initiator methionyl-tRNA locus IMT4 from 
yeast genomic DNA using primers RM636 and 
RM637, digesting the high-copy yeast plasmid 
pRS426 (ATCC #77107) (64) with EcoRI, and 
combining them by Gibson assembly. 

PNTI739 was constructed from pNTI730. 
The GCN4 coding sequence (CDS) was ampli- 
fied from yeast genomic DNA using primers 
RM515 and RM516 and inserted upstream of, 
and in frame with, the synthetic transcription 
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factor by digesting pNTI730 with NhelI and 
introducing GCN4 by Gibson assembly. 

pNTI740 was constructed by digesting NTI730 
with PvulI and Nhel, amplifying the GCN4 
promoter and 5’UTR from yeast genomic DNA 
using primers RM513 and RM514, and com- 
bining the two fragments by Gibson assembly. 

pNTI741, the P(HIS4)-yECitrine PCR tem- 
plate used for downstream CiBER-seq plasmid 
library preparation, was generated by replac- 
ing the P(PGK7) in pNTI725 with P(HIS4). 
PNTI725 was digested with SacI and Nhel, 
P(HIS4) was amplified from yeast genomic 
DNA using primers RM499 and RM500, and 
the two fragments were combined by Gibson 
assembly. 

pNTI742 was made in several steps. First, the 
NotI restriction site of pNTI660 was replaced 
with an Avril restriction site by digesting with 
NotI and introducing a replacement cassette, 
composed of annealed oligonucleotides RM459 
and RM460, by Gibson assembly. Next, the 
vector was digested with SacI and Mfel. The T 
(ADHT) terminator and TruSeq Read1 priming 
site were amplified from pNTI726 using primers 
RM489, RM490, and RM491, in a nested PCR 
approach where RM490 was used at 0.1 times 
the normal concentration. The resulting PCR 
product was joined with the digested vector by 
Gibson assembly. 

pNTI743 was made in several steps. First, 
the yeast ACTI 3'UTR and transcription ter- 
mination sequence was PCR amplified from 
yeast genomic DNA using RM633 and RM634. 
pNTI742 was linearized with BamHI and the 
amplified PCR product was inserted by Gibson 
assembly. This intermediate vector was then 
digested with HindIII and AfllI, and oligo RM 
719 was inserted by Gibson assembly. 

pNTTI744: through pNTI750, expressing gRNAs 
against POP6, SENI5, RPCI7, RPO31, MLCI, 
CDC10, and CHS2, respectively, were subcloned. 
into the integrating plasmid pCfB2189. First, 
DNA fragments containing the gRNA expres- 
sion cassette were amplified from the pool of 
gRNA plasmids using forward primers RM722- 
725, RM813, RM814, and RM817, respectively, 
that anneal to the specific nucleotide barcode 
sequence, and a reverse primer RM519 that 
binds downstream of the gRNA scaffold. Frag- 
ments were amplified with Gibson homology 
arms on each end. The integration vector 
pCfB2189 (Addgene #67532) (63) was digested 
with HindIII and gRNA-expressing fragments 
were transferred into this backbone by Gibson 
assembly. 

PNTI751 was constructed by three-piece 
Gibson of the following DNA fragments. First, 
P(PGKI) was PCR amplified from pNTI725 
using primers RM726 and RM727. Second, the 
GCN4:5’UTR was amplified from pNTI740 using 
primers RM728 and RM729. Third, pNTI740 
was digested with PvuII and HindIII and the 
8600-bp fragment was gel extracted. 


9 of 14 


RESEARCH | RESEARCH ARTICLE 


PNTI752 was constructed by first digesting 
PNTI739 with NhelI and SapI, and gel extract- 
ing the 10-kb fragment. The K50,58R mutation 
was introduced into the GCN4 CDS by PCR 
amplifying the CDS from pNTI739 using 
primers RM730, RM731, and RM732 (731 was 
used at 0.1 times the concentration of the other 
two). pNTI752 was then assembled by three- 
piece Gibson of the linearized backbone, PCR 
fragment, and RM733 splint oligo. 

PNTI753 was constructed in a several steps. 
First, PLPGKT) was subcloned into pNTI742. 
This plasmid was then linearized with Spel 
and gel extracted. PLHIS4) was PCR amplified 
from yeast genomic DNA with primers RM739 
and RM740 and mCherry was PCR amplified 
from a subcloned plasmid template using 
RM741 and RM742. The three resulting frag- 
ments were Gibson assembled. 

pNT1754: through pNT1I756 were cloned using 
a similar method, by first digesting pNTI753 
with Spel and BamHI and gel extracting the 
7-kb backbone fragment. The following pro- 
moters were then amplified from yeast genomic 
DNA as follows: (i) PUVET6) with RM745 and 
RM746; (ii) PPCWPD) with RM747 and RM748; 
and (iii) P(PHO5) with RM749 and RM750. 
Each PCR-amplified promoter was then as- 
sembled into the linearized backbone by Gibson 
assembly. 

PNTI757 through pNTI760 were all cloned 
using a similar method, by first PCR amplify- 
ing respective divergent promoters using the 
following combinations of PCR template pri- 
mers: (i) pNTI753 with RM789 and RM 790; (ii) 
PNTI754 with RM793 and RM794; (iii) pNTI755 
with RM795 and RM796; and (iv) pNTI756 
with RM797 and RM798. Respective PCR 
products were then TOPO cloned using the 
TOPO Blunt system (Thermo Fisher Scientific 
#451245) according to manufacturer protocol. 


Plasmid library construction 


All pooled plasmid libraries were constructed 
by Gibson-style assembly using HiFi DNA as- 
sembly master mix (NEB E2621L) and trans- 
formed using either ElectroMAX DH10B 
(ThermoFisher #18290015) or NEB 10-beta 
Competent Escherichia coli (NEB #C3019H) 
according to manufacturer protocol. All column 
purifications were performed using Zymo DNA 
Clean and Concentrator-5 (Zymo #D4013). Dilu- 
tions of reach transformation were plated to 
estimate library size and ensure sufficiency 
library diversity. Plasmid libraries were har- 
vested from liquid culture using the Monarch 
Plasmid DNA Miniprep Kit (NEB #T1010). 
Miniprep kit reagents were scaled to accommo- 
date one spin column for every 5 ml of culture. 
To generate estradiol-inducible barcode- 
expressed plasmid libraries (fig. S1), pNTI726 
and pNTI728 were first digested with BamHI. 
Five barcodes were generated by annealing 
primer RM396 with one of five barcode primers, 
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RM391, RM392, RM393, RM394, or RM399, 
and extending the annealed duplex using Q5 
polymerase. Four of these barcodes were inserted 
by Gibson assembly into BamHI-linearized 
PNTI726 and the last barcode DNA fragment 
inserted into BamH]-linearized pNTI728. Gibson 
reactions were purified and concentrated, 
transformed into ElectroMAX DHIOB cells, 
and inoculated directly into 200 ml of lysogeny 
broth (LB)-carbenicillin media. Pooled libraries 
were harvested when batch liquid culture reached 
an optical density (OD) of 4. 

To generate gRNA plasmid libraries, 100 pg 
of CRISPRi gRNA oligonucleotide library was 
amplified by PCR using Q5 polymerase using 
primers NM636 and NM637 (6). pNTI742 
was digested with AvrII and the amplified guide 
fragments were introduced into the digested 
vector by Gibson assembly. The assembly 
product was purified and transformed into 
ElectroMAX DHI1O0B cells by electroporation. 
Electroporated cells were inoculated directly 
into 500 ml of LB-carbenicillin media and 
harvested at an OD of 2. Serial dilutions of the 
initial transformation were plated to ensure 
sufficient library diversity (>50x coverage of 
60,000 guides). Plasmid library was purified 
using NEB miniprep kit and pooled library 
was analyzed by Sanger sequencing. 

Barcodes were then added to the gRNA 
plasmid library, targeting an average of four 
barcodes per gRNA (240,000 barcodes in total). 
First, barcode DNA fragments were generated 
by annealing oligos RM504 and RM505 and 
extending the duplex using Q5 polymerase. The 
following thermocycler conditions were used 
to avoid amplification of any specific barcode 
sequence. Initial denaturation at 98°C for 45 s, 
six cycles of annealing and extension at 68°C 
for 15 s and 72°C for 5 s, respectively, and 
finally a 12°C hold. The barcode fragments 
were digested with BamHI and Mfel to elimi- 
nate barcode fragments that contain these 
restriction sites and then purified using a DNA 
Clean and Concentrator-5. The library of gRNA- 
expressing plasmids was then digested with 
AflII and the barcode DNA fragments were 
introduced by Gibson assembly. The assembly 
reaction was purified using a DNA Clean and 
Concentrator-5 then transformed into NEB 
10-beta Competent E. coli (High Efficiency) 
according to manufacturer protocol. Transfor- 
mation dilutions were plated to estimate 
total number of transformants, and in parallel 
inoculated at several dilutions directly into 
LB-carbenicillin media. Liquid culture cor- 
responding to roughly 240,000 transformants 
was harvested at an OD of 2 and plasmid 
library was extracted using NEB miniprep 
kit. Reagents were scaled to accommodate one 
spin column for every 5 ml of culture. This 
barcoded gRNA library was paired-end se- 
quenced (see Barcode to gRNA assignment 
section) to assign barcodes with gRNAs. 
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P(HIS4)-yECitrine was amplified from 
PNTI741 using primers RM501 and RM502, 
P(PGK1)-yECitrine was amplified from pNTI725 
using primers RM501 and RM503, and two 
versions of estradiol-inducible P(Z)-mCherry 
with specific nucleotide identifiers were am- 
plified with either RM522 and RM524 or RM523 
and RM524. The barcoded gRNA plasmid 
library was digested with BamHI and the four 
PCR products were introduced into the digested 
library by Gibson assembly to create the four 
CiBER-seq libraries. These libraries were then 
transformed into ElectroMAX DHI1OB cells by 
electroporation. Electroporated cells were inocu- 
lated directly into 500 ml of LB-carbenicillin 
media and harvested at an OD of 2. Serial dilu- 
tions of the initial transformation were plated 
to ensure sufficient library diversity (>50~ cover- 
age of 60,000 guides). Plasmid library was 
purified using NEB miniprep kit and pooled 
library was analyzed by Sanger sequencing. 

CiBER-seq plasmid libraries for studying 
P(MET6), P(CWPD, P(PHO5), and P(HIS4) re- 
gulation (Fig. 1) were constructed in a similar 
fashion, but with key differences. pNTI743 con- 
taining divergent terminating sequences was 
used as the parent vector into which the gRNA 
library was inserted at the AvrII restriction 
site. Dual barcodes were next inserted into 
this gRNA plasmid library by digesting the 
plasmid library with AsclI, annealing and ex- 
tending RM720 and RM721, and Gibson as- 
sembling. Finally, the divergent promoters 
containing P(PGK/I)-citrine normalizer and 
P(query)-mCherry expression cassette were 
inserted in between the two barcodes by: (i) 
digesting pNT1757 through pNT1760 with BciVI 
and gel extracting divergent promoter tem- 
plate; Gi) digesting dual barcoded gRNA plas- 
mid library with AscI which cuts in between the 
two barcodes; and (ili) Gibson assembling each 
divergent promoter into the barcoded gRNA 
plasmid library and separately transforming 
and preparing these plasmid libraries as out- 
lined above. 


Construction of yeast strains 


All yeast strains were constructed by trans- 
forming PCR products or plasmids linearized 
by restriction enzyme digestion into yeast by 
standard lithium acetate transformation (65). 
Selected, clonal transformants were confirmed 
by two genotyping PCRs, using primer pairs 
that amplify across the junction on either side 
of the integration. Genotyping reactions were 
performed with OneTaq Hotstart Polymerase 
(NEB #M0481S). 

Yeast strains used in this study are listed in 
table S3. Strains with gcn2A or gcn4A geno- 
type were constructed by first amplifying the 
hygromycin resistance cassette from pNTI729 
by PCR, with flanking sequence homologous 
to the 5’ and 3’ ends of the GCN2 or GCN4 
coding sequence, using primers RM579 through 
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RM582 for gen2A and primers RM583 through 
RM586 for gen4A. Primers RM589 and RM590 
were used to genotype the upstream junction for 
gcn2A, whereas RM591 and RM592 were used 
for the downstream junction. Primers RM593 
and RM590 were used to genotype the upstream 
junction for gcn4A, whereas RM591 and RM594 
were used for the downstream junction. 

gRNA expression cassettes were integrated 
using vectors from the EasyClone 2.0 toolkit 
for yeast genomic integration (63). Plasmids 
using backbone pCfB2189 were integrated by 
digesting with NotI, transforming this diges- 
tion, and selecting transformants on synthetic 
complete drop-out (SCD) — Leu plates. Plasmids 
using backbone pCfB2337 were integrated by 
digesting with NotI, transforming this diges- 
tion, plating transformations on nonselective 
yeast extract peptone dextrose (YEPD) media 
overnight, and then replica plating on hygrom- 
ycin plates. Histidine prototrophy was restored 
by digesting pCfB2226 with NotI, transforming 
into the appropriate strain, and selecting trans- 
formants on SCD - His plates. Plasmid pNTI647, 
expressing dCas9 and tetR, was integrated as 
described in (16). Integration of gRNA expres- 
sion cassettes was validated by amplifying 
across the upstream junction with RM527 and 
RM373 and across the downstream junction 
with RM374 and RM528. Integration of syn- 
thetic transcription factor variants were geno- 
typed by amplifying across the upstream junction 
with RM372 and RM373 and across the down- 
stream junction with RM374 and RM375. 


Media 


LB media was prepared by dissolving LB me- 
dium capsules (MP Biomedicals #3002-31) in 
ultrapure water and sterilized by autoclaving, 
according to manufacturer instructions. LB media 
was supplemented with 50 ug/ml carbenicillin 
(Sigma #C1389) for antibiotic selection. YEPD 
was prepared by dissolving yeast extract (RPI # 
Y20026) and peptone (RPI #20241) in ultrapure 
water, sterilizing by autoclaving, then supple- 
menting with 2% final concentration of steri- 
lized dextrose (Fisher Chemical #D16-500). SCD 
media was prepared by dissolving yeast nitro- 
gen base (BD Difco #291940), dextrose to 2% 
final concentration, and the appropriate drop- 
out mix in ultrapure water, then sterilizing 
by filtration. SCD — Leu drop-out media was 
prepared using drop-out mix synthetic minus 
leucine (US Biological #D9525), SCD — Ura drop- 
out media was prepared using drop-out mix 
synthetic minus uracil (US Biological #D9535). 


Quantitative reverse transcription 
PCR (qRT-PCR) 


RNA was isolated from yeast by acid phenol 
extraction (66). Reverse transcription was carried. 
out using Protoscript II reverse transcriptase 
(NEB #M0368L) with oligo-dT priming according 
to manufacturer instructions. RT-PCR was car- 
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ried out using DyNAmo qPCR mastermix 
(ThermoFisher #F410L) according to manufac- 
turer protocols, using a CFX96 Touch Real- 
Time PCR Detection System. Primers, provided 
in table S4, were designed using Primer Blast 
and validated by using a cDNA dilution standard 
curve. Oligonucleotide sequence for RT-PCR 
are listed in table S5. 


Fluorescence measurements 


Expression of yECitrine, under the control of 
the ZEM transcription factor, (Fig. 1) was 
monitored using a 96-well plate reader (Tecan 
SPARK Multimode Microplate Reader). Over- 
night cultures were back-diluted in 96-well 
round bottom plates to OD 0.05 in SCD - Ura 
selective media containing beta-estradiol and/ 
or anhydrotetracycline as indicated. Fluores- 
cence (excitation at 516 nm and emission at 
540 nm) and ODgoo were measured in tripli- 
cate every 15 min. 


Turbidostat continuous culture 


Yeast populations transformed with plasmid 
libraries were inoculated into a custom turbi- 
dostat (67) and maintained in SCD — Ura media 
at a density corresponding to an optical density 
600 nm (OD¢o0) of roughly 0.8. When growth 
rate reached a steady state (corresponding to 
a doubling time of roughly 90 min), a 50-ml 
preinduction sample was collected. gRNA ex- 
pression was then induced by injecting con- 
centrated anyhydrotetracycline into both the 
growth chamber and turbidostat media reser- 
voir to obtain a final concentration of 250 ng/ 
ml of anyhydrotetracycline. Six doublings later 
(9 hours) a 50-ml postinduction sample was 
collected. For 3AT treatment, samples were 
induced by injecting concentrated 3AT into 
both the growth chamber and turbidostat media 
reservoir to obtain a final concentration of 
90 mM 3AT. Two hours later, a postinduction 
sample was collected. For CiBER-seq experi- 
ments involving the ZEM transcription factor, 
media was supplemented with 8 nM beta- 
estradiol throughout the course of the experiment. 
Collected samples were pelleted by centrifu- 
gation at 4000 x g for 5 min, the media was 
aspirated, and the pellets were stored at —80°C. 


Barcode sequencing 


All PCR reactions were performed using Q5 
polymerase (NEB M0491S) according to man- 
ufacturer protocols. PCR cycle numbers were 
adjusted as needed (between 6 and 12 cycles) 
to obtain adequate concentration of product 
for sequencing while avoiding overamplifica- 
tion. DNA was purified using DNA Clean & 
Concentrator (Zymo #D4013) according to the 
manufacturer instructions. When necessary, 
AMPure XP beads (Beckman Coulter #463881) 
were used to purify full-length DNA product. 
Size distributions and concentrations of the 
sequencing libraries were measured before 
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sequencing using an Agilent TapeStation 2200 
and High Sensitivity D1000 ScreenTape. To 
ensure proper clustering on the Illumina 
HiSeq4000, CiBER-seg libraries were either 
pooled with RNA sequencing libraries and 
standard 2% Phix, or individually with 10% 
PhixX. PCR products were pooled and sequenced. 
on HISeq4000 SR50 (Vincent J. Coates sequenc- 
ing facility). Adapter sequences for each library 
are listed in table S6, and sequencing data 
accession numbers are available in table S7. 


DNA library preparation 


Plasmid DNA was extracted from yeast pellets 
using Zymo yeast miniprep kit (Zymo #D2004) 
according to manufacturer protocol, except as 
described below. Reagent volumes were scaled 
to accommodate roughly 250 million cells, as 
yeast pellets included 25 OD6o9 ml of yeast. 
Zymolyase concentration was increased to 1 ul 
for every 10 million cells and the digestion 
time was doubled to ensure complete cell wall 
digestion. 

Extracted barcode expression plasmid lib- 
raries derived from pNTI726 and pNTI728 
backbones (Fig. 1) were prepared by PCR 
amplification of barcode sequences with 
primers that incorporated flanking TruSeq 
adapters. Barcode sequences were first am- 
plified by PCR with RM411 and an i5 primer 
from NEBNext Multiplex Oligos (NEB #E7600S) 
for 12 cycles. PCR products were purified 
and concentrated using a DNA Clean and 
Concentrator-5, and then used as the template 
for an additional six cycles of PCR amplification 
with NEBNext i5 and 17 primers (NEB #E7600S). 

To achieve a more linear amplification, all 
other CiBER-seq DNA libraries were prepared. 
using an initial linear amplification by in vitro 
transcription. Extracted plasmid was linear- 
ized by restriction digestion with Mfel for 
P(PGK1) and P(HIS4) plasmid libraries (Figs. 
2 to 4) or Pvull for P(Z) libraries (Figs. 5 and 
6). The digestion product was purified using 
a DNA Clean and Concentrator-5, and used 
as the template for an IVT reaction using T7 
HiScribe (NEB #E2040S) according to the 
manufacturer protocol for short transcripts, 
with an overnight incubation at 37°C. The 
reaction was then treated with DNase I to 
remove template DNA and RNA product was 
purified using an RNA Clean and Concentrator-5 
(Zymo #R1016). Purified RNA was used as the 
template for reverse transcription using Proto- 
Script II according to manufacturer proto- 
col, using sequence-specific primer RM511 for 
P(PGKD and P(HIS4) plasmid libraries (Figs. 2 
to 4) or RM546 for P(Z) libraries (Figs. 5 and 
6). Reverse transcription cDNA product was 
treated for 30 min with 0.5 ul RNAse A 
(ThermoFisher #EN0531) and 0.5 ul RNAse 
H (NEB #M0297S) to remove RNA, and then 
DNA was purified using a DNA Clean and 
Concentrator-5 column. The sequence-specific 
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reverse transcription primers RM511 and 
RM546 incorporate the 17 priming site, and 
the i5 priming site is included in the plasmid. 
The final library was generated using eight 
cycles of PCR amplification using NEBNext 
i5 and i7 dual index primers, and purified 
cDNA product as template. The resulting 
product was purified using a DNA Clean and 
Concentrator-5 and submitted for Illumina 
high-throughput sequencing. 


RNA library preparation 


Total RNA was harvested from yeast pellets as 
described above for quantitative PCR (qPCR) 
analysis. Extracted RNA from libraries using 
the pNT1I726 or pNTI728 backbones (Fig. 1) 
was used as the template for reverse transcrip- 
tion by ProtoScript II using sequence-specific 
primer RM411, according to the manufacturer 
instructions. Reverse transcription product was 
purified using a DNA Clean and Concentrator-5 
and used as the template for PCR amplifica- 
tion using i5 and i7 dual indexing primers 
(NEB #E7600S). 

In CiBER-seq libraries, the barcode is present 
in the opposite orientation. Reverse transcrip- 
tion was carried out using ProtoScript II 
according to the manufacturer protocol for 
oligo-dT priming. Reverse transcription prod- 
uct was treated with 0.5 ul of RNAse A and 
0.5 ul of RNAse H for 30 min to remove RNA, 
and the DNA product was purified using a 
DNA Clean and Concentrator-5. The resulting 
cDNA product was first amplified by PCR for 
six cycles using primers RM511 and RM512 
for isolated PLHIS4) and P(PGKD libraries, or 
primers RM512 and RM546 for P(Z) libraries, 
primers RM511 and RM810 for P(PGKI) in 
dual barcoded CIBER-seq, and primers RM810 
and RM812 for P(query) in dual barcoded 
CiBER-seq. The PCR products were then puri- 
fied and subsequently amplified in eight cycles 
of PCR using NEBNext i5 and i7 dual indexing 
primers. PCR products were purified using a 
DNA Clean and Concentrator-5 and submitted 
for Illumina high-throughput sequencing. 


Barcode to gRNA assignment 


The sequencing library for paired-end barcode- 
to-gRNA assignment sequencing was constructed 
by PCR amplification from the barcoded gRNA 
library using primers RM506 and RM509 for 
six cycles using Q5 polymerase according to the 
manufacturer instructions. The resulting PCR 
product was purified using a DNA Clean and 
Concentrator-5 and then used as a template for 
a second, 15-cycle PCR using NEBNext 15 and 17 
dual indexing primers. The PCR product was 
purified using a DNA Clean and Concentrator-5 
and analyzed using an Agilent TapeStation 
2200 and High Sensitivity D1000 ScreenTape. 
The PCR product library was then sequenced 
using 150 + 150 base paired-end sequencing 
on an Illumina MiSeq. Barcode to gRNA as- 
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signment sequencing data are available under 
accession SRR10327353. 

The PacBio sequencing library for dual bar- 
codes to gRNA assignment was prepared with- 
out PCR amplification to avoid sequencing 
errors associated with the rolling circle polymer- 
ase and to maintain sequence diversity. First, 
5 ug of dual barcoded gRNA plasmid library 
was digested with HindIII and PmelI and 
another 5 ug digested with BglII and Xhol. 
The ~1200-bp fragment for each digestion, 
containing dual barcode and gRNA were gel 
extracted and prepared for sequencing accord- 
ing to SMRTbell Express Template Prep Kit 
2.0 (100-938-900) according to manufacturer 
instructions. 


Single gRNA induction 


Yeast strains expressing individual gRNAs were 
grown overnight in YEPD, then diluted to 
ODesoo of 0.01 in — Leu media containing either 
250 ng/ml anhydrotetracycline, or no inducer. 
Yeast were harvested 12 hours later (after 
roughly 6 doublings), pelleted at 4000 x g for 
5 min, and pellets were frozen and stored at 
—80°C for subsequent RNA extraction or 
protein isolation. 


Immunoblotting 


Total protein was isolated from frozen yeast 
pellets by resuspending in 1.5 ml of 5% 
trichloroacetic acid and incubated at 4°C for 
<10 min. Protein was pelleted by centrifuga- 
tion at 16,000 x g for 2 min and washed in 
0.5 ml of acetone, vortexed briefly, and col- 
lected by centrifugation for 5 min at 16,000 x g. 
Protein pellets were washed once more in 1 ml 
of acetone, vortexed, and collected by centrifu- 
gation for 5 min at 16,000 x g. Pellets were 
dried overnight and then resuspended in 100 ul 
of freshly prepared protein breakage buffer 
[50 mM TriseHCl pH 7.5, 1 mM EDTA, and 
3 mM dithiothreitol (DTT)] containing 100 ul 
of glass beads (BioSpec 11079105). Samples were 
vortexed in five cycles of 1-min vortexing fol- 
lowed by 1 min on ice. Lysates were transferred 
to new tubes and collected by centrifugation 
at 16,000 x g, then resuspended in 150 ul 
resuspension buffer (100 mM TriseHCl pH 
11, 3% SDS). Samples were boiled for 5 min 
then allowed to cool and pelleted at 16,000 x g 
for 30 s. Then, 120 ul of lysate was transferred 
to a new tube, and a bicinchoninic acid (BCA) 
assay was performed to determine protein 
abundance. Equal amounts of total protein 
were loaded on a 4 to 12% polyacrylamide 
Bis-Tris gel (Thermo Scientific #+NW04120BOX) 
and separated by electrophoresis in MOPS 
buffer at 200 V in a Bolt gel tank (Thermo 
Scientific #425977) according to manufacturer 
instructions. Protein was then transferred to a 
nitrocellulose membrane (Thermo Scientific 
#88018) according to manufacturer guidelines. 
Membranes were blocked for 1 hour in tris- 


11 December 2020 


buffered saline with Tween 20 (TBST) with 
5% milk, washed with TBST three times for 
10 min of shaking, and incubated with primary 
antibodies in TBST plus 0.5% milk overnight. 
Hexokinase loading control was probed with 
rabbit anti-Hxk2 (Rockland #100-4159) and 
phosphorylated eIF2oa was detected using 
Phospho-elF2a (Ser52) Polyclonal Antibody 
(Invitrogen #44-728G). Membranes were then 
washed and probed with secondary antibody 
anti-rabbit immunoglobulin G (IgG), HRP-linked 
antibody (Cell Signaling Technology #707458). 
Membrane was imaged by Pierce enhanced 
chemiluminesence (ECL) Western blotting 
substrate (ThermoScientific #32209) and the 
chemiluminescence measured on a FluorChem 
R (ProteinSimple). 


Sequencing data analysis 


Analysis software is archived (68). Sequencing 
data were processed using Cutadapt to remove 
constant adapter sequences and demultiplex 
libraries based on embedded nucleotide iden- 
tifiers. The adapter sequences for each exper- 
imental dataset are provided in table S6. The 
underlined sequence represents the library- 
specific nucleotide identifier. 

For dual-guide experiments (Fig. 4), demul- 
tiplexing was not required as each pool derived. 
from a single P(HJS4) library. For this rea- 
son, adapter trimming was performed by 
instead taking the first 25 nucleotides using 
fastx_trimmer. 

Trimmed barcodes were then counted using 
the custom “bc-seqs” program, which collapses 
barcode variants separated by single-nucleotide 
mismatches (16). Using the “bc-tabulate” script, 
these barcode counts were then collected into 
a matrix, tabulating samples within an exper- 
iment. The P(H/S4) and P(PGK7) libraries 
were sequenced in two Illumina HiSeq4000 
runs and the barcode counts for the runs 
were summed. 

Barcodes were first filtered to remove those 
that lacked at least 32 counts for at least one 
replicate in the preinduction DNA sample, or 
the preinduction P(PGK/)-normalizing sample 
for dual-barcoded experiments. The remain- 
ing barcodes were evaluated by differential 
activity analysis using mpralm (/8). Barcode 
expression was compared between samples 
before and after gRNA induction or before and 
after 3AT treatment. Analysis was performed 
using the aggregate = “sum” parameter to sum 
barcodes that corresponded to the same gRNA 
and model_type = “indep_ groups,” which treats 
the replicates as independent experiments. 
Output tables were merged with gene infor- 
mation from the Saccharomyces Genome 
Database. Scatterplots comparing log fold 
change values of individual gRNAs showing 
were first thresholded for guides in which q < 
0.05 in at least one of the two expression 
analyses. Barcodes corresponding to no-guide 
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plasmids were used as negative controls to 
determine the distribution of P values. Genes 
involved in amino acid biosynthesis, amino- 
acyl tRNA charging, ISR-controlled translation 
initiation, RNA polymerase III transcription, 
tRNA processing, or actin cytoskeletal arrange- 
ment were hand-curated and used to annotate 
volcano and scatter plots. 
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Mouse embryonic stem cells self-organize into 
trunk-like structures with neural tube and somites 


Jesse V. Veenvliet*t, Adriano Bolondit, Helene Kretzmert, Leah Hautt, Manuela Scholze-Wittler, 
Dennis Schifferl, Frederic Koch, Léo Guignard, Abhishek Sampath Kumar, Milena Pustet, 
Simon Heimann, René Buschow, Lars Wittler, Bernd Timmermann, 


Alexander Meissner*, Bernhard G. Herrmann* 


INTRODUCTION: Vertebrate development com- 
prises multiple complex morphogenetic pro- 
cesses that shape the embryonic body plan 
through self-organization of pluripotent stem 
cells and their descendants. Because mamma- 
lian embryogenesis proceeds in utero, it is dif- 
ficult to study the dynamics of these processes, 
including much-needed analysis at the cellular 
and molecular level. Various three-dimensional 
stem cell systems (“embryoids”) have been de- 
veloped to circumvent this impediment. The 
most advanced models of post-implantation 
development achieved so far are gastruloids, 
mouse embryonic stem cell (mESC)-derived 
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aggregates with organized gene expression 
domains but lacking proper morphogenesis. 


RATIONALE: To advance the current models, 
we explored the usage of Matrigel, an extra- 
cellular matrix (ECM) surrogate. During em- 
bryonic development, the ECM provides 
essential chemical and mechanical cues. In 
vitro, lower percentages of Matrigel can drive 
complex tissue morphogenesis in organoids, 
which led us to use Matrigel embedding in 
various media conditions to achieve higher- 
order embryo-like architecture in mESC-derived 
aggregates. 


Trunk-like structure 
(TLS) 


Somite apicobasal polarity 








FN1 interface 





Chemical Comparative single-cell RNA-seq Genetic 
modulation NMPs g manipulation 
TLS 
Somitic 
o DAB 
E Neural 
| 

















cep 
On, TG 
CSOD 
cee 





“Bunches of grapes” 





Embryo 
WT Tbx6 KO 


Engineering the embryonic trunk in a dish. Embedding of mouse embryonic stem cell (mMESC) aggregates 
in an extracellular matrix (ECM) enables generation of trunk-like structures (TLSs) with an in vivo-like 
architecture including gut, and neuromesodermal progenitor (NMP)—derived neural tube and somites. 
Comparative single-cell RNA sequencing revealed that TLS cell states and differentiation dynamics match 
those of the embryo. Chemical modulation and genetic manipulation highlight the utility of the TLS as a 
scalable, tractable, and accessible model for investigating mid-gestational embryogenesis. FNI, fibronectin. 
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RESULTS: We found that embedding of 96-hour 
gastruloids in 5% Matrigel is sufficient to in- 
duce the formation of highly organized “trunk- 
like structures” (TLSs), comprising the neural 
tube and bilateral somites with embryo-like 
polarity. This high level of self-organization was 
accompanied by accumulation of the matrix 
protein fibronectin at the Matrigel-TLS inter- 
face and the transcriptional up-regulation of 
fibronectin-binding integrins and other cell 
adhesion molecules. Chemical modulation of 
signaling pathways active in the developing 
mouse embryo [WNT and bone morpho- 
genetic protein (BMP)] resulted in an excess 
of somites arranged like a “bunch of grapes.” 
Comparative time-resolved single-cell RNA 
sequencing of TLSs and embryos revealed that 
TLSs follow the same stepwise gene regulatory 
programs as the mouse embryo, comprising 
expression of critical developmental regula- 
tors at the right place and time. In particular, 
trunk precursors Known as neuromesodermal 
progenitors displayed the highest differentia- 
tion potential and continuously contributed to 
neural and mesodermal tissue during TLS for- 
mation. In addition, live imaging demonstrated. 
that the segmentation clock, required for rhyth- 
mic deposition of somites in vivo, ticks at an 
embryo-like pace in TLSs. Finally, a proof-of- 
principle experiment showed that Tbx6-knockout 
TLSs generate ectopic neural tubes at the expense 
of somite formation, mirroring the embryonic 


phenotype. 


CONCLUSION: We showed that embedding of 
embryonic stem cell-derived aggregates in 
an ECM surrogate generates more advanced 
in vitro models that are formed in a process 
highly analogous to embryonic development. 
Trunk-like structures represent a powerful tool 
that is easily amenable to genetic, mechanical, 
chemical, or other modulations. As such, we 
expect them to facilitate in-depth analysis of 
molecular mechanisms and signaling networks 
that orchestrate embryonic development as well 
as studies of the ontogeny of mutant phenotypes 
in the culture dish. The scalable, tractable, and 
highly accessible nature of the TLS makes it 
a complementary in vitro platform for decipher- 
ing the dynamics of the molecular, cellular, and 
morphogenetic processes that shape the post- 
implantation embryo, at an unprecedented 
spatiotemporal resolution. 
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Mouse embryonic stem cells self-organize into 
trunk-like structures with neural tube and somites 
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Post-implantation embryogenesis is a highly dynamic process comprising multiple lineage decisions and 
morphogenetic changes that are inaccessible to deep analysis in vivo. We found that pluripotent mouse 
embryonic stem cells (mESCs) form aggregates that upon embedding in an extracellular matrix 
compound induce the formation of highly organized “trunk-like structures” (TLSs) comprising the 
neural tube and somites. Comparative single-cell RNA sequencing analysis confirmed that this process 
is highly analogous to mouse development and follows the same stepwise gene-regulatory program. 
Tbx6 knockout TLSs developed additional neural tubes mirroring the embryonic mutant phenotype, 
and chemical modulation could induce excess somite formation. TLSs thus reveal an advanced 

level of self-organization and provide a powerful platform for investigating post-implantation 


embryogenesis in a dish. 


ertebrate post-implantation development 

comprises a multitude of complex mor- 

phogenetic processes resulting from self- 

organization of stem cells and their 

descendants shaping the embryonic 
body plan (J). Recently developed stem cell 
models represent powerful platforms for 
deconstructing the dynamics of these processes 
in vitro (, 2). The most advanced models in 
terms of developmental stage are gastruloids, 
self-organizing mMESC aggregates that form 
elongating structures that comprise postocci- 
pital embryo derivatives of all three germ layers 
but lack proper morphogenesis (J-3). In vivo, 
the extracellular matrix (ECM) provides chem- 
ical signals and exerts mechanical constraints 
via the basement membrane, which has a critical 
role in tissue morphogenesis (4, 5). In vitro, 
Matrigel can serve as an ECM surrogate, and 
embedding of gastruloids in 10% Matrigel 
allowed the formation of a string of somite- 
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like structures with anterior-posterior polarity 
(6). Lower percentages of Matrigel facilitated 
complex morphogenesis in organoids (7). 


Matrigel embedding drives 
trunk-like morphogenesis 


To achieve more advanced embryo-like mor- 
phogenetic features in gastruloids, we used 
Matrigel with various media conditions (Fig. 1A). 
To facilitate high-throughput characteriza- 
tion and quantification, we generated mESCs 
with T::H2B-mCherry (hereafter T™") and 
Sox2::H2B-Venus (hereafter Sox2’™) reporters, 
marking the mesodermal (ME) or neural (NE) 
lineage, respectively (fig. SIA). Embedding of 
96-hour aggregates in 5% Matrigel resulted in 
segmentation in the T™“@* ME domain and 
formation of a Sox2’™* neural tube-like 
structure (Fig. 1, A to C, fig. S1, B and C, and 
movies S1 and S2). The vast majority of struc- 
tures [hereafter referred to as trunk-like struc- 
tures (TLSs)] elongated and formed a T™©#* 
pole at the posterior end, with segmentation 
occurring in about half of the TLSs (Fig. 1D). 
Whole-mount in situ hybridization for Tcf15 and 
Uncx confirmed somite identity and revealed 
embryo-like anterior-posterior polarity (Fig. 1E) 
(8). In 61% of the segmented TLSs, bilateral 
somites were observed (fig. S1D). Additional 
WNT activation using CHIR99021 (hereafter 
TLS‘) or combined with bone morphogenetic 
protein (BMP) inhibition by LDN193189 (here- 
after TLS") improved the physical separation 
of neighboring segments without affecting 
T™™ pole formation or elongation (Fig. 1, A 
to D, and fig. S1, E to H) and resulted in an ex- 
cess of segments at the anterior end, arranged 
like a “bunch of grapes” (Fig. 1, B, C, and F) (9). 
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Moreover, the ME domain expanded at the 
expense of the NE compartment, with ap- 
parent disorganization of the posterior end 
and neural tissue (Fig. 1B and fig. S1, G and 
H). This phenotype has not been observed in 
vivo and may be explained by shifting the 
lineage choice of neuromesodermal progeni- 
tors (NMPs)—bipotent cells giving rise to both 
postoccipital NE and ME (J0)—toward ME 
as a result of dominant WNT signaling. In ad- 
dition, T™“"*/Sox2”"* putative NMPs were re- 
duced, as confirmed by flow cytometry (fig. SII). 

Phalloidin and N-cadherin staining dem- 
onstrated that the cells of the neural tube and 
somites of TLS, TLS], and TLS“ show proper 
apicobasal polarity, a characteristic of epithelial 
tissues, with F-actin and N-cadherin accumu- 
lating at the apical side (Fig. 1, G and H, and 
fig. S2, A and B). In gastruloids, by contrast, 
F-actin distribution appeared random and or- 
ganized epithelial structures were not detected 
(fig. S2C). 

Whole-mount immunofluorescence analysis 
of FOXA2 and SOX17, transcription factors char- 
acteristic for endoderm, revealed gut formation 
in a subset of TLSs (fig. $3, A to D). Cells at the 
posterior base were SOX17-negative but coex- 
pressed FOXA2 with high levels of T"™ (fig. 
S3B). Our data show that embedding in Matrigel 
is both necessary and sufficient to drive com- 
plex, embryo-like tissue morphogenesis of the 
three embryonic germ layers. 

We next performed a detailed morphomet- 
ric analysis of TLSs and their substructures. 
The data demonstrate reproducibility of the 
three protocols with respect to size and shape 
of the whole structure, somites, and neural 
tube, whereas the gut-like structure shows more 
variation (fig. S3E and fig. S4, A to D). Time- 
resolved whole-structure morphometry showed 
similar morphogenetic changes over time (fig. 
S4B). TLSs in general were larger than gastru- 
loids (fig. S4A). Comparisons of the different TLS 
protocols revealed that, relative to TLS, TLS© and 
TLS“ were slightly larger and formed more 
somites, whereas their neural domains were 
reduced in length and narrowed toward the 
anterior end (fig. S4, A to D). In all protocols, 
somites were similar in shape but smaller than 
their embryonic counterparts (fig. S4D). 

To assess whether the segmentation clock, 
an oscillator involved in somitogenesis, is ac- 
tive in TLSs, we performed live imaging (77). In 
line with recent observations, we found that 
segmentation occurs in a rhythmic fashion at 
an embryo-like pace in all three TLS condi- 
tions (fig. S5, A to C, and movies S3 to S5) (6). 
TLS* and TLS“ showed consecutive forma- 
tion of multiple somites (movies S6 to S9). 


Transcriptional characterization highlights 
selective responses to chemical modulation 


To characterize the structures in detail at the 
molecular level, we performed RNA sequencing 
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Fig. 1. Generation of trunk-like structures (TLSs) with somites and a 
neural tube. (A) Schematic overview: 200 to 250 mESCs were aggregated in 
ultralow-attachment plates; Wnt agonist CHIR99021 (CHIR) was added 
between 48 and 72 hours (3). At 96 hours, aggregates were (i) cultured as 
gastruloids (3), (ii) embedded in 5% Matrigel (TLS), and (iil) treated with WNT 
signaling activator CHIR99021 (CHIR) alone (TLS°), or (iv) treated with CHIR 
and the BMP signaling inhibitor LDN193189 (TLS“). The two compounds have 
been reported to induce a (pre-)somitic mesoderm fate in 2D and 3D 
differentiation protocols (5). (B) 3D volumetric renderings (top) and confocal 
sections (bottom) of a gastruloid, TLS, TLS°, and TLS°". Scale bars, 100 um. 
Each image is representative of at least 10 biological replicates with similar 
morphology and expression patterns. (©) Segments in TLS are T"™*"* and are 
positioned adjacent to the neural tube. In TLS and TLS“ the segments are 


Gastruloid 
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E9.5 embryo 
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Gastruloid 


ar 
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Uncx 





Phalloidin 


arranged in “bunches of grapes.’ Scale bars, 25 um. Red arrowhead, neural tube; 
white arrowheads, somites. (D) Quantification of morphogenetic features 

in gastruloids, TLSs, TLS°, and TLS“ (see supplementary materials for scoring 
criteria). (E) Segments express somitic markers Uncx and Tcfl5 as shown by 
whole-mount in situ hybridization. Note the characteristic stripe-like expression 
pattern of Uncx in TLS due to posterior restriction, whereas Tcfl5 is expressed 
throughout the segments (as in the embryo). Scale bars, 100 um. (F) In TLS° and 
TLS“, Unex is detected throughout the segments, indicating loss of anterior- 
posterior polarity. Scale bars, 100 um. (G and H) Confocal sections showing that 
cells of somites and neural tube display apical-basal polarity with NCAD and 
F-actin (phalloidin) accumulating at the apical surface. Inset represents an optical 
section of the neural tube shown in the main panel. Scale bars, 50 um (G), 

10 um (H). White arrowheads, somites; red arrowheads, neural tubes. 


(RNA-seq) analysis (fig. S6A) and found that 
the TLS models the postoccipital embryo, 
similar to gastruloids (Fig. 2A) (3). Relative 
to TLS, both TLS° and TLS“ showed signi- 
ficant up-regulation of genes involved in 
(pre)somitic development [e.g., 7bx6, Msgr, 
Hes7 (8, 10, 12)| at the expense of NE marker 
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genes [e.g., Sovl, Pax6, Irx3 (13)], corroborat- 
ing the flow cytometry and imaging results 
(Fig. 2A and fig. S6, B and C). The analysis of 
marker gene sets for NMPs, for their direct 
descendants undergoing lineage choice (NMP 
ME and NMP NE), and for committed NE and 
ME cells substantiated this finding. Moreover, 
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TLS© and TLS“ displayed reduced expression 
of markers in all clusters, including ME, rela- 
tive to TLS (Fig. 2B). In contrast, on average 
(pre)somitic mesoderm [(P)SM]-specific markers 
were up-regulated, whereas intermediate ME 
(IM) and lateral plate ME (LPM) markers were 
down-regulated in TLS] and further reduced 
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Fig. 2. Transcriptional characterization of TLSs. (A) Heatmap of logo(TPM + 
0.01) expression (TPM, transcripts per million) of selected genes associated with 
development of indicated embryonic structures in 96-hour aggregates and 
120-hour gastruloids, 120-hour TLSs, 120-hour TLS°, and 120-hour TLS“, as 
measured by RNA-seq. Replicates were derived from pools of independent 
biological samples (see fig. S6A for experimental setup). CE, caudal end; NMP, 
neuromesodermal progenitors; PSM, presomitic mesoderm; LPM, lateral plate 
mesoderm; IM, intermediate mesoderm. (B) Box plots showing distribution 

of marker genes for indicated cell types (average z-score). Boxes indicate 
interquartile range (IQR); whiskers extend to 1.5 x IQR from the hinge. Dots 
indicate outliers. Notches are centered on the median. See data S1 for lists of 
genes used for each category and statistical analysis. (C) Box plot representing 
average z-score per column (pool of three replicates) for somite epithelialization 
factors (see fig. S6E for individual genes). (D) PCA analysis of samples from (A) 
with color coding of individual samples (dots) as in (A). PC1 and PC2 represent 


the two components with highest percentage of explained variance. (E) Selected 
Significant terms of gene set enrichment analysis (GSEA) enriched in TLSs as 
compared to gastruloids at 120 hours. See data S2 for full list of significant terms 
[false discovery rate (FDR) < 0.05]. (F) Heatmap of scaled expression (row z-score) 
and logs FC of integrins with significantly different expression (Paajror) < 0.09) in 
120-hour TLSs versus gastruloids. Box plot represents z-score per column (sample), 
with boxes indicating interquartile range, whiskers extending to 1.5 x IQR from the 
hinge, dots showing outliers, and central line representing median. Each column 
represents one of three biological replicates. See data S1 for statistical analysis. (G) 3D 
maximum-intensity projection (top three images) and confocal section showing FN1 
accumulation around TLS somites and neural tube (zoomed-in image, white 
arrowheads). Phalloidin staining shows apical-basal polarity. Scale bars, 100 um, 
50 um for magnification. Bottom: Light-sheet optical transversal section showing 
FN1 accumulation around the somites (white asterisk) and neural tube (red asterisk) in 
TLSs. Scale bar, 50 um. 


in TLS (Fig. 2B and fig. S6D) (3, 12), in line 
with the known role of WNT versus BMP 
signaling in PSM versus IM and LPM specifi- 
cation (Fig. 2A and fig. S6D) (8). 

We next searched for gene expression dif- 
ferences possibly underlying improved physical 
separation of somites observed in TLS] and 
TLS“. Among the most strongly up-regulated 
genes relative to TLSs was Wnt6, which acts 
as a somite epithelialization factor in vivo (fig. 
S6E) U4). In addition, multiple ephrins and 
their receptors, and other factors involved in 
somite epithelialization, were up-regulated (Fig. 
2C and fig. S6E) (15, 16). We observed expression 
changes of selected somite polarity markers and 
their inducers, in line with the role played by 
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WNTs, SHH, BMPs, and their antagonists in 
somite compartmentalization in vivo (fig. S6, 
F and G) (8, 17). Our data show that exposure 
to CHIR or CHIR/LDN improved segment 
boundary formation but affected somite cellular 
composition. 


Tissue morphogenesis and remodeling genes 
are up-regulated in TLSs 


Principal components analysis (PCA) indicated 
a high transcriptional similarity between gas- 
truloids and TLSs despite profound morpho- 
logical differences (Fig. 2D). The latter are better 
highlighted by gene set enrichment analysis 
(GSEA), which showed that Matrigel embedding 
promotes tissue morphogenesis and remodeling 
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(Fig. 2E and fig. S6H). Zooming in on embryonic 
and tissue morphogenesis gene sets revealed 
markers of blood vessel development among 
up-regulated genes, which suggests the induc- 
tion of capillary morphogenesis in TLSs (fig. 
S6I). GSEA also showed an enrichment of cell 
adhesion terms and overall a significant up- 
regulation of corresponding marker genes in 
TLSs (Fig. 2E and fig. S7, A and B). The most 
pronounced increase was observed for integ- 
rins, transmembrane receptors mediating cell 
adhesion to the ECM (important for, e.g., neural 
tube formation, blood vessel development, and 
segmentation) (Fig. 2F and fig. S7, A and B) 
(18-21). Because binding of integrin to the 
glycoprotein fibronectin (FN1) and matrix 
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Fig. 3. Time-resolved single-cell RNA-seq of TLSs. In total, 20,294 cells 
were sampled from TLSs at 96, 108, and 120 hours (see fig. S9A for 
experimental setup). (A) UMAP (uniform manifold approximation and 
projection) colored by the 14 clusters identified. (B) Alluvial plot of 
percentage of neuromesodermal progenitors (NMPs), posterior presomitic 
mesoderm (pPSM), anterior PSM, somitic, and neural tube cells over time. 

(C) NMPs coexpress Sox2 and T (top left; blending with normalized expression 































120h 


threshold = 0.25) and are characterized by the highest differentiation 
potential (top right; see supplementary materials for differentiation potential 
calculation). NMPs coexpress T and SOX2 at 96 and 120 hours and reside 
at the posterior end of the TLS (confocal sections, bottom left and 
magnifications, 3D maximum-intensity projection, whole structure); white 
arrowheads, NMPs. Scale bars, 50 um for 96-hour TLS, 100 um for 3D 
maximum-intensity projection, 20 um for magnifications. 
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Fig. 4. TLS differentiation trajectories. (A) UMAP colored by identified 
clusters with trajectories inferred from RNA velocity. Gray arrow flows 
represent calculated velocity trajectories. (B) Heatmap with scaled 
expression of genes involved in somitogenesis measured in 5966 cells from 
120-hour TLSs rooted in NMPs and ordered by pseudotime. (C) UMAP colored 
by expression of indicated genes (left) and whole-mount in situ hybridization for 
the same genes in TLS and E9.5 embryos (right). Numbers indicate the fraction of 
TLSs with embryo-like expression. Scale bars, TLS, 100 um; embryo, 200 um. 

(D) Heatmap with scaled expression of genes involved in neural development 
measured in 3462 cells from 120-hour TLSs rooted in NMPs and ordered by 
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pseudotime (top) and UMAP colored by expression of indicated genes (bottom). 
(E) Split violin plots showing expression of marker genes for primordial germ cell- 
like cells (PGCLCs, left) and confocal sections of TLS showing PGCLC 
specification dynamics: T/PRDM14*® double positive at 76 hours, S0X2\E"'8"/ 
DPPA3* PGCLCs at 108 hours, and DPPA3* cells in close contact with the T™C™* 
gut-like structure at 120 hours. At 120 hours, Sox2Y=""®" cells in contact with 
FOXA2* gut-like domain or DPPA3* cells in contact with TTO" gut-like domain 
were observed in seven of nine TLSs (see also fig. SI4D). Scale bars, 20 um for 
76-hour TLS, 50 um for 108-hour TLS (20 um for magnifications), 25 um for 
120-hour TLS. Red arrowheads, gut-like structure; white arrowheads, DPPA3” PGCLCs. 
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Fig. 5. TLS cell states are embryo-like and Tbx6-”~ TLSs recapitulate the 
embryonic knockout phenotype. (A) Schematic of our comparative tran- 
scriptome analysis of TLSs with postoccipital E7.5 and E8.5 embryos at the 
single-cell level. (B) TLS UMAP colored by assigned embryonic cell states. TLS 
clusters are projected as corresponding colored contours. Blue font, TLS 
clusters; red font, embryo clusters. (C) Split heatmap with percentage of 
assigned cells (dark gray) and certainty score (orange) for TLS cells from 

the indicated cluster upon unbiased mapping to the in vivo counterpart. Font 
colors as in (B). (D) Alluvial plot of percentage of cells assigned to the indicated 


assembly play an important role in somite and 
neural tube formation (2/, 22), we asked whether 
FN1, which is highly expressed in TLSs, was 
retained by Matrigel, as recently shown in 2D 
culture (fig. S7C) (23). FN1 protein clearly ac- 
cumulated at the TLS-Matrigel interface; this was 
not observed in gastruloids, where FN1 likely 
diffuses into the medium (Fig. 2G, fig. S7D, and 
movie S10). Taken together, our data suggest 
that the activation of morphogenetic programs 
by Matrigel embedding is driven by ECM as- 
sembly involving integrins and fibronectin. 


Single-cell RNA-seq demonstrates embryo-like 
dynamics of cell differentiation 


We next focused on the TLS condition because 
it produced the most in vivo-like structures. 
After confirming reproducibility at the molec- 
ular level (fig. S8, A to C), we performed a 
time-resolved single-cell RNA-seg (SscRNA-seq) 
analysis on a total of 20,294 postprocessed cells 
sampled from TLSs at 96, 108, and 120 hours 
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(fig. SOA). Clustering analysis identified 14 dif- 
ferent cell states. The larger clusters corre- 
sponded to derivatives of the PSM and NE that 
flank putative NMPs, whereas smaller clusters 
comprised endoderm, endothelial cells, and 
primordial germ cell-like cells (PGCLCs) (fig. 
SOB). The main clusters organized into a con- 
tinuum of states recapitulating spatiotemporal 
features of the developing postoccipital embryo 
(Fig. 3A). Across the three time points sampled, 
progenitor cells gradually decreased in favor 
of more mature neural and somitic cells as 
development progressed (Fig. 3B and fig. S9C). 
Putative NMPs coexpressing T and SOX2, or 
T™H Sox2”™ and CDX2, were located at the 
posterior end at 96 and 120 hours (Fig. 3C 
and fig. S10, A to C) (10, 24). TLS-NMPs thus 
display an in vivo-like NMP signature and 
have the highest differentiation potential, as 
they give rise to differentiating cells of both 
neural and mesodermal lineages (Fig. 3C and 
fig. S10, D to F). 
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in vivo clusters in 96-, 108-, and 120-hour TLSs. (E) Simplified schematics of 
Tbx6”~ in vivo phenotype and knockout/reporter constructs. (F) Quantification 
of segmentation phenotype in TLS-Tbx6 
experiments performed with two independent mESC lines of each genotype. 
(G) Formation of ectopic neural tubes in TLS-Tbx6 
identified as SOX2*/Tbx6"** tubular structures flanking the main SOX2*-only 
neural tube. Green, SOX2: magenta, Tbx6Y". White arrowheads indicate 
Tbx6Y=* somites in the wild type (WT) and Tbx6“"*/SOX2* ectopic neural tubes 
in Tbx6”~. Scale bars, 50 um. 


~~. Data represent three different 


~~. Ectopic neural tubes are 


RNA velocity analysis revealed neural and 
somitic trajectories rooted in the NMPs, further 
suggesting that the TLS recapitulates the de- 
velopmental dynamics observed in the mid- 
gestational embryo (Fig. 4A and fig. S11A) (25). 
In vivo, NMPs and their descendants are ar- 
ranged in order of progressive maturity along 
the posterior-to-anterior axis (8). Accordingly, 
ordering of TLS-derived cells along a pseudo- 
temporal trajectory showed that the somitic 
trajectory reflects the genetic cascade observed 
in the embryo (Fig. 4B and fig. S12A). For 
example, the trajectory from Fgf8" NMPs and 
PSM, through the determination front marked 
by Mesp2, to Meox1” somites, was faithfully 
recapitulated, and the embryo-like spatial ar- 
rangement was confirmed by whole-mount in 
situ hybridization (Fig. 4C) (8). Likewise, the 
genetic cascade from NMPs to neural progen- 
itors reflected the in vivo differentiation path 
in space and time (Fig. 4D). Subclustering of the 
neural cells demonstrated that TLSs generate 
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both dorsal and ventral neural subtypes, with 
dorsal subtypes being more prevalent (fig. S12B) 
(13). The analysis of Hox gene expression at con- 
secutive time points showed in vivo-like collin- 
ear activation, as described for gastruloids (fig. 
S12C) (3). To test whether TLS somites establish 
cell states segregated along the dorsal-ventral 
(D-V) and anterior-posterior (A-P) somite axes in 
vivo, we reclustered all somitic cells. At 96 hours, 
we detected two main groups corresponding 
to the Uncx" posterior and Tbx18" anterior 
somite domains, in line with the A-P polarity 
established during segmentation (fig. S13, A 
and B) (6, 8). At 120 hours, we found distinct 
clusters of Pax3* (putative dorsal dermomyo- 
tomal) and Pax1* (putative ventral sclerotomal) 
cells, as well as a small cluster of Lbx1*/Met* 
putative migratory limb muscle precursors 
(fig. S13, C to F) (8, 26). In addition, Scx* 
syndetome cells were detected (fig. S13G), 
and Uncx and Tbx18 expression were anti- 
correlated (fig. S13H). 

Primordial germ cell (PGC) specification in 
the embryo occurs between embryonic day (E) 
6.0 and E6.5 via T-mediated activation of 
Blimp! and Prdml4, and at E7.5, nascent 
PGCs can be identified as a group of DPPA3* 
cells in the posterior primitive streak, which 
later migrate along the hindgut to the gonads 
(27, 28). We assigned PGCLC identity using 
marker genes characteristic for PGCs and 
identified their location in the TLS (Fig. 4E 
and fig. S14). At 76 hours, roughly correspond- 
ing to stage E6.5, we detected T/Prdm14""- 
coexpressing cell clusters (Fig. 4E and fig. S14, 
A and B). At 108 hours, we found a group of 
Sox2’E M2" cells that coexpressed DPPA3 (Fig. 
4E and fig. S14C). At 120 hours, Sox2V® rish 
cells were detected in contact with FOXA2* 
cells, and DPPA3* cells in contact with a T™!* 
gut-like epithelial structure (Fig. 4E and fig. 
S14D). These data show that TLSs contain cells 
displaying characteristics typical for PGCs. 


TLSs display a high complexity of cell states 
that match their in vivo counterparts 


Single-cell comparison of 120-hour gastruloids 
with 120-hour TLSs identified different propor- 
tions of the major cell states (fig. S15, A to C). 
A more refined analysis revealed a higher 
complexity of cell states in TLSs (fig. S16, A 
to E, fig. S17, A to F, and fig. S18, A to D), and 
expression of later (more posterior) Hox genes 
suggests development into more advanced trunk 
stages (fig. S18, E and F). The comparison of 
TLSs with TLS“ showed that in the latter, (i) 
sclerotomal and more mature neural cells are 
virtually absent, and (ii) somitic as well as 
endothelial cell identities are altered (fig. 
S15, B and C, fig. S16, A and B, fig. S17, A to G, 
and fig. S18, A and B). Application of RNA 
velocity confirmed that in TLS“", NMPs are 
highly biased toward the mesodermal lineage, 
whereas contribution to the neural lineage is 
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diminished relative to TLS-NMPs; this is fur- 
ther corroborated by up-regulation of posterior 
PSM and down-regulation of neural marker 
genes (fig. S15, D and E) (8, 13). 

To investigate how close the cellular states 
identified in TLSs resemble those in embryos, 
we mapped TLS single-cell transcriptomes to a 
scRNA-seq compendium of postoccipital embry- 
onic cellular subtypes (Fig. 5A) (29). The data 
revealed globally high accordance of TLS and 
embryonic cell states including characteristic 
marker genes, and pairwise comparison of 
mapped clusters identified only a small frac- 
tion of differentially expressed genes (Fig. 5, B 
and C, and fig. S19, A to D). Of note, PS- and 
early NMP-like cells were exclusively present 
at 96 hours and were replaced by late NMP- 
like cells at 108 and 120 hours (Fig. 5D). Taken 
together, our scRNA-seg analyses demonstrate 
that the TLS executes gene regulatory programs 
in a spatiotemporal order resembling that of 
the embryo. 


Knockout TLSs display the embryonic 
mutant phenotype 


Finally, to explore the utility of TLSs further, 
we conducted a proof-of-concept experiment 
to test whether gene ablation would reproduce 
the embryonic mutant phenotype. In vivo, loss 
of Tbx6 results in transdifferentiation of 
prospective PSM and subsequent formation 
of ectopic neural tubes at the expense of PSM 
and somites (Fig. 5E) (30, 37). We deleted 7626 
from Tbx6::H2B-Venus (Tbx6"°) mESCs and 
generated TLSs, which clearly failed to form 
somites even upon treatment with CHIR or 
CHIR/LDN (Fig. 5, E and F, and fig. S20, A and 
B). Quantitative polymerase chain reaction 
analysis on fluorescence-activated cell sorter- 
purified Tbx6’"* cells revealed up-regulation 
of neural markers at the expense of (P)SM 
markers in Tbx6~ cells, thus recapitulating 
the in vivo phenotype at the molecular level 
(fig. S20C). Finally, whole-mount immuno- 
fluorescence analysis for SOX2 showed that 
Tbx6’— TLSs generated ectopic Tbx6”"* neu- 
ral tubes, whereas gastruloids, TLS®, and TLS™ 
formed an excess of morphologically indistinct 
SOX2* tissue (Fig. 5G and fig. S20, D to F). 


Discussion 


The TLS model provides a scalable, tractable, 
readily accessible platform for investigating 
the lineage decisions and morphogenetic pro- 
cesses that shape the mid-gestational embryo 
with high spatiotemporal resolution. Our re- 
sults show that the TLS faithfully reproduces 
key features of postoccipital embryogenesis, in- 
cluding axial elongation with coordinated neu- 
ral tube, gut, and somite formation as well as 
PGCLCs. Accordingly, genetic manipulation 
of the TLS faithfully reproduced the morpho- 
genetic and molecular changes observed in vivo. 
Thus, the TLSs will enable deeper analysis of 
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the ontogeny of mutant phenotypes and pro- 
vide an additional tool for investigating mor- 
phogenetic mechanisms unavailable in vivo. We 
also envision that the TLS© and TLS“ mod- 
els may become important for testing current 
concepts of somitogenesis—for instance, the 
hypothesis that somite size and shape are con- 
trolled by local cell-cell interactions (9). 

Mechanistically, our data highlight a crucial 
role for the ECM surrogate in unlocking the 
potential of in vitro derived mESC aggregates, 
although future efforts will have to address 
the exact functional contribution of individual 
components and biophysical properties (fig. S21), 
possibly using modular synthetic 3D matrices 
(7, 32). Alternatively, the single-cell expression 
catalog of TLSs and gastruloids can provide 
initial guidance for further exploration of cell- 
cell and cell-matrix interactions and their con- 
trol of embryonic architecture (fig. S7, A and B, 
and figs. S22 and S23). 
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Developmental clock and mechanism of de novo 
polarization of the mouse embryo 
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INTRODUCTION: During preimplantation de- 
velopment, the establishment of apicobasal 
cell polarity is key for the transition from 
totipotency to pluripotency, which induces 
cell differentiation toward trophectoderm 
(TE). In the mouse embryo, this event is pro- 
grammed to occur at the eight-cell stage, and 
this timing follows an intrinsic developmen- 
tal clock that is independent of embryo size 
or cell cycle progression. Despite the import- 
ance of apical domain formation, the molec- 
ular mechanisms to establish cell polarization 
and the temporal regulation of this event in 
mouse and human embryos have remained 
largely elusive. 
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RATIONALE: In different mammalian species, 
zygotic genome activation (ZGA) is evolutionar- 
ily conserved to occur before the establishment 
of cell polarization. We therefore hypothesized 
that zygotic transcription regulates the timing 
of polarization. To test this, we deployed assays 
to alter the cellular concentration of zygotic 
transcripts and to assess the consequences of 
these changes for the timing of embryo polar- 
ization. We also performed an RNA interfer- 
ence (RNAi) screen on 124 zygotically expressed. 
transcripts to determine the molecular identity 
of zygotic transcripts crucial to cell polariza- 
tion. Finally, we combined cutting-edge imaging 
methods with biophysical modeling to account 














Molecular mechanism and temporal regulation for the apical domain formation. Zygotic genome 
activation enables the expression of Tfap2c and Tead4, which regulates the cooperative membrane 
recruitment of apical proteins. Cooperative recruitment interacts with Rho GTPase-regulated apical 
protein lateral mobility to establish the apical domain. 
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for how the factors we identified regulate the 
de novo establishment of cell polarization. 


RESULTS: Cell polarity in the mouse embryo 
is marked by the appearance of a cap-shaped 
apical domain. Consistent with our hypothe- 
sis, an increase or decrease of zygotic transcripts 
respectively accelerated or inhibited apical do- 
main formation. Our RNAi screen identified 
two transcription factors—transcription factor 
AP-2 gamma (Tfap2c) and TEA domain tran- 
scription factor 4 (Tead4), which play a redun- 
dant role in regulating cell polarization timing. 
Both Tfap2c and Tead4 proteins accumulate 
after ZGA, and elevation of their expression 
allows polarity proteins to anchor to the apical 
surface prematurely at the four-cell stage. How- 
ever, these apical proteins failed to organize 
into an expanded apical domain, instead becom- 
ing hypercentralized to form membrane protru- 
sions. This indicates that an additional condition 
is required for the apical domain formation. We 
have previously characterized that Rho guano- 
sine triphosphatase (GTPase) signaling, which 
regulates the actomyosin apical localization and 
becomes activated around the eight-cell stage, 
is important for cell polarity. In this study, we 
found that premature activation of Rho GTPase 
with expression of Tfap2c and Tead4: allows a 
complete, precocious induction of the apical do- 
main, leading to the premature expression of TE 
transcription factors and to morphogenesis 
events downstream of cell polarization. By com- 
bining quantitative imaging measurements and 
mathematical modeling, we show that apical 
domain formation is driven by the dynamic in- 
terplay between two key processes: (i) the coop- 
erative recruitment of ezrin via the actin network 
and (ii) the lateral mobility of ezrin on the mem- 
brane. On the basis of the experimental evi- 
dence and biophysical simulations of these 
interactions, we show that Tfap2c and Tead4 
control the cooperative recruitment of ezrin, 
whereas RhoA promotes membrane mobility. 


CONCLUSION: The timing and mechanisms for 
cell polarization have remained largely unknown. 
We now identify molecules that are necessary 
and sufficient for the de novo establishment of 
cell polarization in the mouse embryo. Our re- 
sults indicate a direct role of ZGA in regulating 
the timing of cell polarization. Beyond identify- 
ing the key molecules sufficient to establish 
cell polarization, we also provide biophysical 
understanding of the mechanism by which 
these molecules act to build cell polarization 
in the mammalian embryo. 
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Developmental clock and mechanism of de novo 
polarization of the mouse embryo 


Meng Zhu!, Jake Cornwall-Scoones“, Peizhe Wang’, Charlotte E. Handford’*, Jie Na?, 


Matt Thomson“, Magdalena Zernicka-Goetz’*** 


Embryo polarization is critical for mouse development; however, neither the regulatory clock nor the 
molecular trigger that it activates is known. Here, we show that the embryo polarization clock reflects the 
onset of zygotic genome activation, and we identify three factors required to trigger polarization. Advancing 
the timing of transcription factor AP-2 gamma (Tfap2c) and TEA domain transcription factor 4 (Tead4) 
expression in the presence of activated Ras homolog family member A (RhoA) induces precocious 
polarization as well as subsequent cell fate specification and morphogenesis. Tfap2c and Tead4 induce 
expression of actin regulators that control the recruitment of apical proteins on the membrane, whereas 
RhoA regulates their lateral mobility, allowing the emergence of the apical domain. Thus, Tfap2c, Tead4, and 
RhoA are regulators for the onset of polarization and cell fate segregation in the mouse. 


he totipotent mammalian zygote can 
produce any embryonic or extraem- 
bryonic tissue, but this ability becomes 
restricted in the first cell fate decision 
that generates distinct inner cell mass 
(ICM) and outer extraembryonic trophecto- 
derm (TE). The ICM will form epiblast (EPI) 
and extraembryonic primitive endoderm (PE), 
generating the future fetus and the yolk sac, 
respectively. The TE will form the placenta. 
Formation of these three lineages by implan- 
tation is a prerequisite for successful pregnancy. 

Embryo polarization is key to the segrega- 
tion of the ICM and TE lineages (J, 2). In the 
mouse, this process happens at the eight-cell 
stage (2-4), when each blastomere acquires an 
apical domain, comprising the Par protein 
complex and ERM proteins (ezrin, radixin, 
and moesin) enclosed by an actomyosin ring 
(5, 6). The apical domain enables expression 
of transcription factors such as Cdx2 and 
Gata3, which drive differentiation into TE, 
whereas apolar cells maintain pluripotency 
to become ICM (7, 8). 

Mammalian embryo development is regu- 
lative, yet the timing of embryo polarization 
remains unchanged even if embryos are split 
into individual blastomeres, when cells are 
aggregated together, or when cell divisions 
are prevented. Thus, the polarization seems 
set to a strict developmental clock that is in- 
dependent of cell number (9, 10). Here, we 


‘Department of Physiology, Development and Neuroscience, 
University of Cambridge, Downing Street, Cambridge CB2 
3DY, UK. “Division of Biology and Biological Engineering, 
California Institute of Technology (Caltech), Pasadena, CA 
91125, USA. °Centre for Stem Cell Biology and Regenerative 
Medicine, School of Medicine, Tsinghua University, Beijing 
100084, China. “Centre for Trophoblast Research, University 
of Cambridge, Cambridge CB2 3EG, UK. 

*Corresponding author. Email: mz205@cam.ac.uk 


Zhu et al., Science 370, eabd2703 (2020) 


show that this clock reflects activation of the 
zygotic genome, and we identify three fac- 
tors whose convergent activity triggers self- 
organization of the apical domain. 


Critical threshold of transcripts required for 
embryo polarization 


Polarization timing varies between species, 
reflecting the onset of zygotic genome activa- 
tion (ZGA) (11, 12). In mouse, the major wave 
of ZGA occurs at the two-cell stage, but an 
additional transcriptional wave also occurs at 
the early eight-cell stage, just before polar- 
ization (13). To determine whether the latter 
transcriptional wave is associated with po- 
larization, we first treated embryos from the 
early eight-cell stage with two transcription 
inhibitors: 5,6-dichlorobenzimidazole 1-f-p- 
ribofuranoside (DRB) and triptolide. Each 
drug prevented apical localization of the po- 
larity marker Pard6 (Fig. 1, A to D, and fig. S1, 
A to G). Polarization was restored by wash- 
ing out the reversible transcription inhibi- 
tor DRB (fig. S1, H to J). Thus, transcription 
at the early eight-cell stage appears to be re- 
quired for embryo polarization. As a second 
test, we reduced the cytoplasmic volume (Fig. 
1E), shown by others to increase the concen- 
tration of newly synthesized mRNA (14, 15). 
We injected zygotes with an apical marker 
[ezrin-red fluorescent protein (RFP) mRNA] 
and resected 30 to 40% of the cytoplasm from 
a two- or four-cell-stage blastomere using a 
technique that does not compromise devel- 
opment (16) (Fig. 1F). Single-molecule fluo- 
rescence in situ hybridization confirmed that 
this reduction increased the concentration 
of newly transcribed mRNAs (17) (fig. S2, A 
to E). Such blastomere resection advanced 
embryo polarization by 2.1 hours when per- 
formed at the two-cell stage (Fig. 1, F to I; N = 
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62 pairs; movie Sl) and by 3.3 hours at the 
four-cell stage (fig. S2, F to H, and Fig. 1J; N = 
76 pairs; movie S2). Both experimental and 
control embryos established all three lineages 
as blastocysts (fig. S2, I to K). Inhibiting tran- 
scription with a 3-hour pulse of DRB led both 
resected and control blastomeres to polarize 
simultaneously (Fig. 1K and fig. S2, L to N). 
These results indicate that de novo synthe- 
sis of transcripts and their accumulation to 
a critical threshold is required for embryo 
polarization. 


Tfap2c and Tead4 are required 
for embryo polarization 


We hypothesized that the requirement for 
eight-cell-stage transcription could result from 
direct expression of cytoskeletal regulators 
of cell polarization or their indirect expression 
through a transcriptional hierarchy. We there- 
fore interrogated published single-cell RNA 
sequencing (RNA-seq) data (18) and selected 
genes for 118 cytoskeletal polarity regula- 
tors and six transcription factors that show 
increased expression by the eight-cell stage 
and hence are likely to be active according to 
assay for transposase-accessible chromatin 
sequencing (ATAC-seq) (19) (fig. S3 and tables 
S1 and S2). We down-regulated each of these 
124 genes by RNA interference (RNAi) (figs. 
S3 and S4, A and B) and scored the timing of 
embryo polarization from time-lapse imaging 
of the distribution of ezrin-RFP (Fig. 2, A to 
D). Only depletion of the transcription factors 
Tfap2c (transcription factor AP-2 gamma) or 
Tead4 (TEA domain transcription factor 4) 
prevented embryo polarization (Fig. 2, A to C, 
and E, and fig. S4, A to E). Individual deple- 
tion of Tfap2c and Tead4 delayed polariza- 
tion from the 8- to 16-cell stage (Fig. 2, A to C, 
E, and F), whereas polarization was entirely 
abolished by their co-depletion (Fig. 2, D to 
H). Depletion of Tfap2c and Tead4 also pre- 
vented precocious polarization resulting from 
blastomere resection (fig. S4, F and G). 

To confirm the requirement for Tfap2c and 
Tead4 in polarization, we deleted both genes 
by CRISPR-Cas9 mutagenesis. We designed 
three single-guide RNAs (sgRNAs) to target a 
single protein-coding exon of each gene (Fig. 
21) and injected them into the zygote together 
with Cas9 mRNA and ezrin-RFP mRNA as an 
apical marker. We categorized blastomeres 
on the basis of whether they had undetectable, 
moderate, or wild-type levels of Tfap2c or Tead4: 
proteins at the 8- to 16-cell stage (fig. S5, A and 
B) and confirmed by DNA sequencing that 
blastomeres with undetectable Tfap2c or Tead4: 
were homozygous mutants (Fig. 2J and fig. 
S5, C and D). Simultaneous deletion of Tfap2c 
and Tead4 completely abolished embryo po- 
larization, in contrast to their individual dele- 
tions, which were less severe (Fig. 2, J to L, 
and fig. S5, E and F). Thus, zygotic expression 
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Fig. 1. The dependency of polarization on nascent transcripts. (A) Scheme 
indicating inhibitor treatments. (B) Dimethyl sulfoxide (DMSO)-treated (control) or 
DRB-treated 8- to 16-cell embryos were analyzed for the localization of F-actin, 
Pard6, and DNA. Arrowheads indicate the apical domain. (C) Polarized cell number 
in DMSO (control) or DRB-treated embryos. ****P < 0.0001, Mann-Whitney U test. 
N = 2 experiments. (D) Apical enrichment of Pard6 (see methods) in 8- to 16-cell- 
stage cells treated with DMSO (control) or DRB. ****P < 0.0001. Mann-Whitney 
U test. (E) Scheme of the hypothesis: Newly synthesized factors important for 
polarization accumulate up to a point at which polarization is induced at the eight- 
cell stage. Decreasing the cell size elevates the concentration of such factors, 
leading to an advance in polarization timing. (F) Scheme showing the blastomere 
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resection procedure. (G and H) Time-lapse of control or smaller sister blastomeres 
from the experiment described in (F). Arrowheads indicate the apical domain. Dotted 
yellow squares indicate the magnified regions (top row). (I and J) Polarization time 
difference (see methods) between smaller and control sister blastomeres from (F) or 
fig. S2F; each bar represents one comparison. Smaller cells polarize earlier in the 
significant majority of cases. N = 13 experiments for (I), N = 6 experiments for (J). 
****P < 0.0001, one-sample t test, hypothetical mean = O. (K) Polarization time 
difference between control and smaller DRB-treated sister cells, from experiments 
in fig. S2L. Each bar represents one comparison. Pulsed transcription inhibition 
prevents the early polarization of smaller cells. N = 3 experiments. One-sample t test, 
hypothetical mean = 0. Arrowheads indicate the apical domain. Scale bars, 15 um. 
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of Tfap2c and Tead4 is required for embryo | coactivator Yap, to induce TE transcription | the eight-cell stage (20) (fig. S6, A to C) that 
polarization at the eight-cell stage. factor expression (7). To gain further insight | was diminished by down-regulation (fig. S6, 

Tead4 had previously been shown to func- | into the earlier role of Tead4, we examined | D and E) and enhanced by up-regulation of 
tion only downstream of polarization, after | the localization of Yap. We found Yap local- | Tead4 (fig. S6, F and G). Thus, Tead4 affects 
nuclear relocalization of its transcriptional | ization in the nucleus before polarization at | the localization of Yap before polarization, 


Fig. 2. Zygotic ex- 
pression of Tfap2c 
and Tead4 is essen- 
tial for polarization. 
(A to D) Time lapse of 
ezrin-RFP localization in 
embryos with or with- 
out Tfap2c and/or 
Tead4. Tfap2c and 
Tead4 co-depletion 
causes polarization 
failure until the 16-cell 
Stage. Arrowheads indi- 
cate the apical domain. 
(E and F) Polarized cell 
number in different con- 
ditions and stages. Each 
dot, square, or triangle 
represents an embryo. 
dsTT, dsTfap2ct+dsTead4. 
ns, not significant; *P = 
0.0306; ***P = 0.0006; 
****P < 0.0001. Kruskal- 
Wallis test for (E), 
one-way analysis of 
variance (ANOVA) test 
for (F). (G) F-actin, oO. 
Pard6, and ezrin local- 5 
ization in late eight-cell- © 
stage embryos injected 
with double-strand RNA 
targeting GFP (dsGFP; 
control) or dsTT. 

(H) Quantification of 
ezrin apical enrichment. 
eP S O0, 
Student's t test. 
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indicating a previously undescribed, polarity- 
independent Tead4 function. 


Advancing expression of Tfap2c, Tead4, 
and Rho GTPase advances polarization timing 


Having found that Tfap2c and Tead4 are re- 
quired for embryo polarization, we sought 
to determine whether advancing their expres- 
sion would advance the timing of polariza- 
tion. We therefore injected Tfap2c and Tead4: 
mRNAs into one blastomere at the late two- 
cell stage to elevate their expression at the four- 
cell stage (fig. S7, A to C). Advancing expression 
of Tead4 did not induce premature polar- 
ization (fig. S7, D, E, I, and J). Advancing 
expression of Tfap2c led to formation of cell 
protrusions enriched in apical polarity pro- 
teins, including Pard6 and ezrin at the four-cell 
stage (fig. S7, F to I). Advancing the expres- 
sion of Tfap2c and Tead4 together also in- 
duced premature formation of protrusions 
(fig. S7, G to I; Fig. 3, B and C; and movies S3 
and S4). In all cases, the induced protrusions 
were smaller than the natural apical domain 
at the eight-cell stage (Fig. 3B; movies S3 and 
S4; and fig. S7H). These results suggested that 
Tfap2c and Tead4 might be sufficient to trig- 
ger polarization of apical proteins but that 
other factors are required for proper apical 
domain formation. 

We previously found that actomyosin acti- 
vation by protein kinase C-Rho guanosine tri- 
phosphatase (GTPase) signaling was necessary 
but not sufficient to trigger apical domain 
formation (Fig. 3, B and C, and movie S5) (2D. 
We therefore hypothesized that Rho GTPase 
activation might be required in addition to 
Tfap2c and Tead4 to achieve complete polar- 
ization. To test this, we injected mRNAs for 
Tfap2c and Tead4 at the two-cell stage (with 
ezrin-RFP mRNA as apical marker) and mRNA 
for constitutively active Ras homolog family 
member A (RhoA)-Q63L (GIn°’—Leu) at the 
four-cell stage [with LifeAct-green fluores- 
cent protein (GFP) as injection marker] (Fig. 
3A and fig. S7K). Expression of all three fac- 
tors established complete apical domains at 
the four-cell stage (Fig. 3, B and C, and movie 
S6). These induced apical domains were en- 
riched with ezrin and Pard6, strongly resem- 
bling apical domains that normally form only 
at the eight-cell stage (Fig. 3B and fig. S8, B 
and C). 

To confirm these results, we overexpressed 
these factors in just half the embryo and found 
that the targeted blastomeres polarized earlier 
than controls in the other embryo half (Fig. 3, 
D to F, and movies S7 and S8). We observed no 
differences in division timing between blasto- 
meres, suggesting that induced four-cell-stage 
polarization is not caused by a cytokinesis de- 
lay (fig. S8A). Our results indicate that a tran- 
scriptional program triggered by Tfap2c and 
Tead4 alongside activation of actomyosin 
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downstream of Rho GTPase signaling triggers 
polarization at a specific stage of embryogenesis. 


Advancing Tfap2c, Tead4, and Rho 
GTPase expression advances 
morphogenesis and differentiation 


Embryo polarization at the eight-cell stage is 
followed by the zippering of adjacent apical 
domains, which expand and seal their boun- 
daries at the late 16-cell stage to enable blasto- 
cyst formation (22). To determine whether 
premature polarization could also advance 
the zippering process, we induced expression 
of Tfap2c, Tead4, and RhoA-Q63L either in the 
whole embryo or in half of the embryo to trig- 
ger four-cell-stage polarization and followed 
subsequent development by time-lapse mi- 
croscopy. The induced premature polarization 
resulted in zippered domains associated with 
the tight junction protein ZO-I, not at the 16-cell 
stage but at the eight-cell stage (fig. S8, B to F). 
Thus, embryo polarization is sufficient to ad- 
vance the subsequent step of embryogenesis 
leading to blastocyst formation. 

As polarization in the mouse embryo is fol- 
lowed by cell fate specification, we determined 
whether overexpressing Tfap2c, Tead4, and 
RhoA-Q63L to induce polarization at the four- 
cell stage would advance differentiation of 
cells inheriting an apical domain into TE. Pre- 
mature polarization induced premature ex- 
pression of the TE transcription factors Cdx2 
and Gata3 in a cell-autonomous manner (fig. 
S8, Gto K, and movies S9 and S10). Thus, the 
combined activities of Tfap2c, Tead4, and RhoA- 
Q63L are sufficient to advance the timing not 
only of polarization but also the differentia- 
tion program. 


Tfap2c and Tead4 are required for apical 
protein centralization 


To define the relative roles of RhoA, Tfap2c, 
and Tead4 in driving polarization, we visual- 
ized events leading to apical domain forma- 
tion in living embryos expressing LifeAct-GFP 
and ezrin-RFP from the mid to late 8-cell stages. 
During apical protein polarization, ezrin-RFP 
first became concentrated at the center of the 
cell contact-free surface to form an apical patch 
concomitant with a local reduction of actin. We 
refer to this stage as centralization (Fig. 4, A and 
B, and movie S11). This apical patch of ezrin-GFP 
expanded, and actin became concentrated in a 
ring around it. We refer to this phase as ex- 
pansion (Fig. 4, A and C, and movie S11). Down- 
regulation of Tfap2c and Tead4 diminished 
the initial centralization of ezrin-RFP (Figs. 2G 
and 4D), implying that centralization is re- 
quired for apical domain formation. 


Tfap2c and Tead4 control polarized growth 
of apical protein clusters 


Imaging of apical domain centralization in mid 
to late eight-cell-stage embryos with higher 
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temporal-spatial resolution (movie S12) re- 
vealed that ezrin formed clusters that colocal- 
ized with actin clusters when the embryo had 
just compacted but was not yet polarized 
(Fig. 4, Eand J). As polarization progressed, 
the ezrin clusters grew; the more-distant clus- 
ters grew faster than those near cell-cell con- 
tacts, resulting in ezrin enrichment toward the 
middle of the cell-contact free surface (Fig. 4, E 
and F). The amount of membrane-associated 
ezrin increased as the ezrin clusters grew (Fig. 
4G), suggesting that cluster growth is driven 
by ezrin’s recruitment to the membrane. 

Overexpression of Tfap2c and Tead4 led to 
an increase in membrane enrichment of ezrin 
and precocious growth of both ezrin and actin 
clusters at the late-four-cell stage (Fig. 4, H, I, 
and L to N). By contrast, Tfap2c and Tead4 
depletion decreased ezrin’s membrane enrich- 
ment and prevented growth of ezrin and actin 
clusters at the mid-eight-cell stage (Fig. 4, J 
to N). Together, this suggests that Tfap2c and 
Tead4 are required for the growth of actin and 
apical protein clusters, recruitment of apical 
protein to the membrane, and centralization 
of apical protein. 


Tfap2c and Tead4 regulate actin dynamics 
to promote apical protein cluster growth 


As our results suggested that Tfap2c and Tead4: 
could regulate actin dynamics to direct the 
growth of apical protein clusters, we next ex- 
amined the cortical movements generated by 
the contractile actomyosin network during 
apical domain formation (23) (movie S13). We 
first tested whether such cortical movements 
can drive asymmetric growth of apical protein 
clusters by tracking LifeAct and ezrin clus- 
ters using particle image velocimetry (PIV) 
(movie S14; and see methods section in the 
supplementary materials). In contrast to actin 
flows of postmitotic cells (22) (fig. S9C), PIV 
did not detect any obvious movement toward 
the center of the cell-contact free surface (fig. 
S9, A and B). Accordingly, we found that in- 
hibiting actin flows with blebbistatin failed to 
prevent asymmetric ezrin cluster growth (fig. 
S9, D to G, and movie S15). Thus, asymmetric 
clustering of ezrin is not driven by the cortical 
flow mediated by actomyosin contractility. 
Time-lapse observations revealed that ezrin 
cluster growth occurred during the merging 
and splitting of actin clusters and was unim- 
peded by blebbistatin (Fig. 5A), suggesting that 
cortical actin remodeling may allow ezrin’s re- 
cruitment to growing clusters. Consistently, per- 
turbing actin depolymerization in eight-cell 
embryos with Jasplakinolide (JASP) prevented 
apical domain formation (fig. S9, H and K). 
Moreover, when we inhibited the Arp2/3 com- 
plex with CK666 to prevent actin nucleation, 
ezrin cluster growth (Fig. 5, B and C) and 
apical domain formation (fig. S9, I and K) 
were also blocked. By contrast, treatment of 
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Fig. 4. Tfap2c and 
Tead4 regulate 
apical domain cen- 
tralization by 
regulating apical 
protein clustering. 
(A) LifeAct-GFP and 
ezrin-RFP dynamics 
during polarization. 
Squares denote mag- 
nified regions; yel- 
low and gray arrows 
indicate apical protein 
or actin ring move- 
ments. The apical 
domain forms after 
centralization and 
expansion steps. 

N = 7 embryos, N = 
4 experiments. 

(B and C) Ezrin-RFP 
Signal at the cell- 
contact free surface 
during centralization 
or expansion steps 
in (A). Ezrin signal is 
normalized against 
average membrane 
Signal intensity. 

(D) Ezrin-RFP dis- 
tribution on cell- 
contact free surface 
at the late eight- 
cell stage in 

dsGFP (control) or 
dsTfap2ct+dsTead4 
injected cells. 
Numbers indicate 
examined cells. 

N = 2 experiments. 
(E) Ezrin-RFP dis- 
tribution during api- 
cal centralization. 
(F) Ezrin cluster size 
during polarization. 
More than 1500 
clusters were ana- 
lyzed for each time 
point. Data shown 
as mean + SD. 

N = 2 experiments. 
(G) Ezrin membrane 
enrichment during 
centralization. 

N = 4 cells from four 
embryos. Data shown 
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Fig. 5. Clustering of apical proteins is regulated by local actin dynamics. (A) LifeAct-GFP and ezrin-RFP dynamics with or without blebbistatin treatment during 
polarization. Arrowheads indicate the merging of ezrin clusters during actin polymerization. Scale bar, 1 um. N = 5 cells for each condition. N = 3 experiments. 
Blebbistatin treatment did not prevent the clustering of actin or ezrin proteins. (B) LifeAct-GFP and ezrin-RFP localization in mid eight-cell stage embryos treated with 
DMSO (control) and CK666. Scale bars, 5 um. (C) Ezrin cluster size in cells treated with DMSO or CK666 in (B). ****P < 0.0001, Mann-Whitney U test. More 
than 1500 clusters were measured in each condition, N = 2 experiments. (D) F-actin and ezrin-RFP localization in late four-cell-stage embryos expressing ezrin-RFP 
only, or with Tfap2c+Tead4 treated with or without CK666. (E) Ezrin cluster size in (D). ****P < 0.0001, Kruskal-Wallis test. More than 1100 clusters were calculated in 
each condition, N = 2 experiments. (F) Experimental strategy for RNA-seq. (G) Heatmap showing the expression of selective cytoskeleton regulators downstream 
of Tfap2c and Tead4. (H) LifeAct-GFP and ezrin-RFP localization in dsGFP or dsArpclb injected embryos. (1) Polarized cell numbers in dsGFP and dsArpclb groups. 
Numbers within bars represent cell number. ***P < 0.001, Fisher’s exact test. (J) Ezrin-RFP apical enrichment in dsGFP or dsArpclb cells. Each dot represents a 
cell. ***P < 0.001, Student's t test. (K) LifeAct-GFP and ezrin-RFP localization in different conditions. Squares indicate magnified regions (right). (L) Number of cells 


showing apical protrusions in (K). Scale bars, 15 um. 


embryos with a formin inhibitor (SMiFH2) 
did not affect apical domain formation (fig. 
S9, J and L). These observations indicate that 
actin remodeling is required for ezrin cluster- 
ing and apical domain formation. We also 
found that CK666 abolished apical protein 
polarization induced by Tfap2c and Tead4 
overexpression in four-cell embryos (Fig. 5, D 
and E). Thus, Tfap2c- and Tead4-dependent 
regulation of actin dynamics is required for 
the growth of ezrin clusters and apical pro- 
tein formation. 


Tfap2c and Tead4 control expression 
of actin regulators 


To view events downstream of Tfap2c and 
Tead4, we carried out RNA-seq of eight-cell- 
stage embryos depleted of Tfap2c and/or Tead4. 
For each group of embryos (control GFP RNAi, 
Tfap2c RNAi, Tead4 RNAI, and Tfap2c/Tead4 
co-RNAi), two biological replicates were col- 
lected with 10 embryos per sample, and ex- 
periments were performed on two strains to 
eliminate genetic background effects (Fig. 5F 
and methods). The effect of Tfap2c and Tead4: 
depletion was highly reproducible between 
biological replicates and between genetic back- 
grounds (fig. SIOA). Depletion of Tfap2c led 
to the down-regulation of 749 or 929 genes 
(with a twofold differential cutoff), depending 
on the strain, whereas Tead4 depletion led to 
down-regulation of 242 or 314 genes (fig. S10, 
B and C). Their co-depletion led to an addi- 
tional 135 or 95 genes being down-regulated 
compared with single knockdown embryos, 
depending on the strain (fig. S10, B and C). 
A significant proportion of down-regulated 
genes in double-knockdown embryos had actin 
polymerization functions (Fig. 5G and table 
S3). These included known actin regulators 
for apical domain formation, such as Cdc42 
effector protein family members (Borg) (24), 
and other actin regulators, including the Arp2/3 
complex component Arpclb, the tropomyosin 
protein Tpm4, Marcks and Marcksl1 proteins, 
and the FREM family member Ebp4..115, whose 
functions have not been explored in the mouse 
embryo. Expression of these actin regulators 
becomes up-regulated between the two- and 
eight-cell stages and correlates with the size 
increase of actin clusters during polarization 
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(Fig. 5G). Depletion of Tfap2c and Tead4 elimi- 
nated expression of actin regulators and ac- 
cordingly led to a decreased actin cluster size 
(Figs. 4, H and J to M, and 5G). 

To test whether these actin regulators par- 
ticipate in apical domain formation, we depleted 
Arpclb, Tom4, Marcksll, or Ebp4.1l5 individually 
from two-cell embryos and determined the 
apical domain formation efficiency at the late 
eight-cell stage (Fig. 5, H to J, and fig. S11, A 
to D). Consistent with the effects of CK666, 
depletion of Arpclb led to defective apical 
domain formation (Fig. 5, H to J) in natural 
eight-cell embryos and prevented Tfap2c- and 
Tead4-induced apical protein polarization at 
the four-cell stage (Fig. 5, K and L). Together, 
these results suggest that Tfap2c and Tead4 
control embryo polarization by activating ex- 
pression of actin regulatory proteins. 


RhoA signaling reorganizes the actin network 
during polarization 


Knowing that not only Tfap2c and Tead4 but 
also RhoA-Q63L were required for apical pro- 
tein clustering and apical domain formation 
in four-cell embryos, we aimed to determine 
how excess RhoA activity (by overexpressing 
RhoA-Q63L) or reduced RhoA activity (by treat- 
ment with RhoA inhibitor C3-transferase) at 
the mid eight-cell stage would affect ezrin’s 
membrane distribution (fig. S12A). Overexpres- 
sion of RhoA-Q63L eliminated actin and ezrin 
cluster formation, resulting in the homoge- 
neous distribution of actin and ezrin on the 
membrane (fig. S12, B and C). By contrast, 
C3-transferase treatment resulted in the ec- 
topic clustering of actin and ezrin on the cell 
membrane (fig. S12, D and E), reminiscent of 
four-cell embryos overexpressing Tfap2c and 
Tead4 but lacking RhoA activity (Fig. 3B and 
fig. S7, G and H). Thus, RhoA signaling is re- 
quired to reorganize the actin network in the 
embryo and thereby the clustering of apical 
proteins induced by Tfap2c and Tead4. 


Positive feedback and lateral mobility govern 
apical domain formation 


To gain biophysical understanding of how 
Tfap2c, Tead4, and RhoA regulate the timing 
and pattern of apical domain formation, we 
measured the growth of ezrin cluster size during 
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apical protein centralization and found that 
it increases exponentially (Fig. 4F), suggesting 
involvement of a positive feedback mechanism. 
To gain understanding of this mechanism, we 
tagged ezrin with green to red photoactivatable 
Dendra2 fluorescent protein to track Dendra2- 
ezrin movement after blue light conversion. 
RFP signal dynamically dissipated within sec- 
onds when Dendra2 was photoconverted at 
the mid eight-cell stage either within or out- 
side the nascent apical domain, suggesting 
rapid ezrin membrane turnover (Fig. 6, A to 
D). Irrespective of the site of photoconversion, 
ezrin-Dendra2 relocated to the nascent do- 
main in proportion to the concentration of 
ezrin in this area, indicating positive feed- 
back of ezrin on its own accumulation. This 
correlates with previous measurements of 
cooperative recruitment of ezrin to phospha- 
tidylinositol 4,5-bisphosphate (PIP2) mem- 
branes (25, 26). 

To determine whether ezrin dynamics can 
account for apical protein centralization, we 
constructed a model (see methods) based on 
four empirically grounded assumptions (Fig. 
6E): (i) a cooperative increase in ezrin binding 
rate (kon) with increasing ezrin concentration 
saturating above a critical concentration (Ecrit) 
suggested by the positive feedback above 
(Fig. 6, B and D) and known cooperative 
binding of ezrin to membranes (25, 26); (11) a 
limit to ezrin membrane loading by the finite 
pool of PIP2 (Po) based on colocalization of 
ezrin with PIP2 in the apical domain (fig. S12F) 
and prevention of apical domain formation by 
reduced PIP2 (27); (iii) lateral motility of ezrin 
along the membrane, as observed (Fig. 6, A 
to D, and fig. S12G), which can be modeled as 
effective diffusion with diffusivity (Dg); and 
(iv) dissociation of ezrin at a uniform rate from 
the membrane (Kose) (Fig. 6E). 

We simulated ezrin dynamics in one di- 
mension and estimated parameter values for 
simulations best fitting our experimental 
measurements (see methods and “Supple- 
mentary Modeling” section in the supplemen- 
tary materials). Our simulations reproduced 
the dynamic changes of ezrin distribution during 
centralization in vivo (Fig. 6F). The model also 
recapitulated the ability of single cells to form 
a centralized ezrin domain in the absence of 
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Fig. 6. Tfap2c, Tead4, and RhoA regulate apical domain formation through 
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cell-cell contacts (fig. S12H and methods) and 
the ability of cell-cell contacts to constrain 
apical ezrin to the center of cell-contact free 
surface (fig. S121). Thus, experimental ob- 
servations and computational simulation to- 
gether reveal that positive feedback, lateral 
mobility, and competition for limiting PIP2 
are sufficient to explain the apical protein 
centralization step. 


Tfap2c, Tead4, and RhoA control 
cooperative recruitment and lateral 
mobility of apical proteins 


Our model predicts that elevating the satura- 
tion threshold (Eert) for cooperative recruit- 
ment of ezrin would change the steady-state 
distribution of membrane-bound ezrin, result- 
ing in a narrowed peak resembling the Tfap2c 
and Tead4 overexpression phenotype (Fig. 6G 
and “Supplementary Modeling” section of the 
supplementary materials). This suggests that 
Tfap2c and Tead4 regulate the kinetics of the 
cooperative ezrin recruitment. To test this, we 
computed the rate of change in membrane- 
bound ezrin concentration (AE/AT) as a func- 
tion of the ezrin concentration during apical 
centralization, for both control and Tfpa2c 
and Tead4 overexpressing embryos (fig. S12J). 
This rate of change revealed that when the 
ezrin concentration falls below a threshold 
value, local ezrin subsequently decreases, but 
when the local concentration exceeds this val- 
ue, more ezrin gets recruited to the membrane 
(fig. S12J). 

Myosin motor activity within the actin cortex 
can affect the lateral mobility of membrane 
proteins that can negatively regulate the for- 
mation of membrane clusters (27), suggesting 
that RhoA might regulate the lateral mobility 
of ezrin. To test this, we quantified the “spread” 
of photoconverted signal (see methods) and 
compared the difference between control and 
RhoA-Q63L-overexpressing cells (fig. S12K). 
Elevating RhoA activity increased the spread- 
ing effect of photoconverted ezrin on the mem- 
brane, suggesting that RhoA positively regulates 
the lateral mobility of ezrin (fig. S12K). In line 
with this, our simulations predict that increas- 
ing lateral mobility leads to homogeneously 
distributed ezrin (Fig. 6G) and that insuffi- 
cient lateral mobility results in multiple peaks 
of apical protein (fig. S12L). These predictions 
are concordant with the phenotypes resulting 
from overexpression of RhoA-Q63L and deple- 
tion of RhoA in the eight-cell embryo (fig. S12, 
B to E). 

Our model suggests that lateral mobility and 
the cooperative recruitment threshold have 
opposing effects on the shape of the apical 
patch (Fig. 6H and fig. S12M). Specifically, a 
normal apical domain will form only when 
both processes are activated at an appropri- 
ate level. This prediction provides a qualitative 
explanation for the concurrent requirements 
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for Tfap2c, Tead4, and RhoA signaling in reg- 
ulating apical domain formation (Figs. 3B 
and 61). 


Discussion 


The importance of the first appearance of 
apical-basal cell polarity in mammalian de- 
velopment is evident from its requirement 
for triggering the first cell fate diversification 
event. Here, we show that zygotic expression 
of Tfap2c and Tead4 is a prerequisite for such 
polarization. We have been able to induce pre- 
cocious embryo polarization and thereby ad- 
vance subsequent embryogenesis by driving 
the ectopic expression of Tfap2c, Tead4, and 
activated Rho GTPase. Our findings help ac- 
count for the temporal relationship between 
zygotic genome activation and the establish- 
ment of embryo polarization across multiple 
mammalian species (//, 12). 

Embryo polarization at the eight-cell stage 
has been viewed as a model for epithelial po- 
larization. However, formation of the apical 
domain is distinct from that of many other 
cell types, as it can occur in the absence of ex- 
ternal cues, such as the extracellular matrix 
or cell adhesion. The mechanisms behind such 
distinctive spontaneous symmetry-breaking 
properties have remained elusive. Here, we 
show that the initial step for symmetry break- 
ing is the centralization of apical proteins 
through their two types of behavior on the 
membrane: actin-mediated cooperative re- 
cruitment and lateral mobility. These two 
processes act as opposing forces to regulate 
the shape of the apical domain; a cooperative 
recruitment mechanism enables symmetry 
breaking and concentration of the apical pro- 
teins, whereas lateral mobility allows apical 
proteins to diffuse, thereby establishing a 
crescent-shaped patch (Fig. 6, G and H, and 
fig. SI2M). The balanced activity between 
the two processes ensures the proper shape 
of the apical domain because an excessive 
cooperative recruitment force would lead to 
small and often multiple domains, whereas 
excessive lateral mobility would lead to the 
uniform distribution of apical proteins and 
thereby inhibit symmetry breaking (Fig. 6G 
and fig. S12L). 

Our results suggest that cooperative recruit- 
ment is regulated by actin remodeling con- 
trolled by Tfap2c and Tead4. Although the 
detailed mechanism is beyond the scope of 
this work, it is possible that ezrin is preferen- 
tially recruited to the actin structure promoted 
by the Arp2/3 complex in a process similar to 
protein condensation (28). In such a case, the 
density of the branched actin network would 
positively affect the saturation level of ezrin in 
the cooperative recruitment process. It has 
been observed in vitro that the actin clusters 
formed by Arp2/3 activity are degraded by 
high levels of cortical myosin (29), which could 
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explain the opposing effects between the two 
transcription factors and RhoA in regulating 
the apical protein clustering and accordingly 
the apical domain shape. 

The regulatory regime we describe is based 
on the behavior of ezrin, and it is likely to 
apply to other apical proteins, such as the Par 
complex, whose polarization dynamics are 
highly similar and also require the actin net- 
work and membrane binding, the key condi- 
tions of the process we describe (27). Our work 
illustrates how the embryo establishes cell 
polarization at a specific developmental stage 
under the regulation of stage-dependent path- 
ways. Our results also provide a biophysical 
explanation for how polarization is established, 
indicating that positive feedback combined 
with lateral mobility are sufficient to drive this 
self-organization process. These results there- 
fore provide insight into both the timing and 
mechanism of the establishment of de novo 
polarization in the mouse embryo, the critical 
event for the first cell fate specification. 


Methods summary 


This work followed regulations of the Animals 
(Scientific Procedures) Act 1986 Amendment 
Regulations 2012 reviewed by the University 
of Cambridge Animal Welfare and Ethical Re- 
view. Embryos were collected from superovu- 
lated F1 females (C57BI6xCBA) crossed with F1 
males. For embryo culture and inhibitor treat- 
ment, embryos were recovered at the zygote or 
two-cell stage in M2 medium and transferred 
to KSOM medium for long-term culture. The 
inhibitors—or the same amount of vehicles 
(for control conditions)—were applied to the 
culture. The microinjection procedure, im- 
munostaining, static imaging and image pro- 
cessing, and real-time quantitative polymerase 
chain reaction were carried out as previously 
described (27). For the photoconversion ex- 
periment, the region of interest (ROI) covered 
a rectangular area of ~5 um length and ~2 um 
width on the membrane of cells expressing 
ezrin-Dendra2, using the midplane of the 
blastomeres as a reference. The ROI was il- 
luminated at 405 nm for 5 s, after which con- 
verted proteins were imaged with a 568-nm 
laser at an emission wavelength between 580 
and 620 nm every 2 s per frame for 5 min. The 
converted scanning speed is 200 Hz, and 
the normal scanning speed is 700 Hz. For all 
imaging settings, the images have been re- 
corded using the 1024 pixel by 1024 pixel for- 
mat. PIV analysis was performed using the 
PIVlab MATLAB algorithm (https://pivlab. 
blogspot.com/). For statistics, the sample dis- 
tribution as well as statistical tests were per- 
formed using Prism software (www.graphpad. 
com/). Details of the materials and methods, 
as well as details for RNA-seq and modeling 
methods, can be found in the supplementary 
materials. 
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INTRODUCTION: Emerging infectious diseases 
pose one of the greatest threats to human 
health and biodiversity. Phylodynamics is an 
effective tool for inferring epidemiological param- 
eters to guide intervention strategies, particularly 
for human viruses such as severe acute respira- 
tory syndrome coronavirus 2 (SARS-CoV-2). How- 
ever, phylodynamic analysis has historically been 
limited to the study of rapidly evolving viruses 
and, in rare cases, bacteria. Nonetheless, applica- 
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tion of phylodynamics to nonviral pathogens has 
immense potential, such as for predicting disease 
spread and informing the management of wild- 
life diseases. 

We conducted a phylodynamics analysis of 
devil facial tumor disease (DFTD), a transmis- 
sible cancer that has spread across nearly the 
entire geographic range of Tasmanian devils 
and threatens the species with extinction. 
DFTD is transmitted as an allograft through 
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Tasmanian devils and their transmissible cancer. Healthy (top) and DFTD-infected (bottom) Tasmanian devils. 
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biting during common social interactions, sus- 
ceptibility is nearly universal, and case fatality 
rates approach 100%. The goals of our study 
were to (i) characterize the geographic spread 
of DFTD, (ii) identify whether there are differ- 
ent circulating tumor lineages, and (iii) quan- 
tify rates of transmission among lineages. 


RATIONALE: In principle, phylodynamics should 
be readily extended to the study of slowly 
evolving pathogens with large genomes through 
careful interrogation of genes to identify those 
that are measurably evolving. By testing indi- 
vidual genes for a clocklike signal, these genes 
may then be used for phylodynamic analysis. 
We demonstrate this proof of concept in DFTD. 


RESULTS: We screened >11,000 genes across 
the DFTD genome, identifying 28 that exhibited 
a strong, clocklike signal, and performed the 
first phylodynamic analysis of a genome larger 
than a bacterium. We demonstrate here, con- 
trary to field observations, that DFTD spread 
omnidirectionally throughout the epizootic, 
leaving little signal of geographic structuring 
of tumor lineages across Tasmania. Despite 
predictions of devil extinction, we found that 
the effective reproduction number (Rẹ), a 
summary of the rate at which disease spreads, 
has declined precipitously after the initial epi- 
demic spread of DFTD. Specifically, Rg peaked 
at a high of ~3.5 shortly after the discovery of 
DFTD in 1996 and is now ~1 in both extant 
tumor lineages. This is consistent with a shift 
from emergence to endemism. Except for a single 
gene, we found little evidence for convergent 
molecular evolution among tumor lineages. 


CONCLUSION: We have demonstrated that 
phylodynamics can be applied to virtually any 
pathogen. In doing so, we show that through 
careful interrogation of the pathogen genome, 
a measurably evolving set of genes can be 
identified to characterize epidemiological dy- 
namics of nonviral pathogens with large genomes. 
By applying this approach to DFTD, we have 
shown that the disease appears to be transition- 
ing from emergence to endemism. Consistent 
with recent models, our inference that Rr ~1 
predicts that coexistence between devils and 
DFTD is a more likely outcome than devil ex- 
tinction. Therefore, our findings present cau- 
tious optimism for the continued survival of 
the iconic Tasmanian devil but emphasize the 
need for evolutionarily informed conservation 
management to ensure their persistence. 


The list of author affiliations is available in the full article online. 
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A transmissible cancer shifts from emergence to 
endemism in Tasmanian devils 
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Paul A. Hohenlohe”, Andrew Storfer!* 


Emerging infectious diseases pose one of the greatest threats to human health and biodiversity. 
Phylodynamics is often used to infer epidemiological parameters essential for guiding intervention 
strategies for human viruses such as severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2). 
Here, we applied phylodynamics to elucidate the epidemiological dynamics of Tasmanian devil facial 
tumor disease (DFTD), a fatal, transmissible cancer with a genome thousands of times larger than that 
of any virus. Despite prior predictions of devil extinction, transmission rates have declined precipitously 
from ~3.5 secondary infections per infected individual to ~1 at present. Thus, DFTD appears to be 
transitioning from emergence to endemism, lending hope for the continued survival of the endangered 
Tasmanian devil. More generally, our study demonstrates a new phylodynamic analytical framework that 


can be applied to virtually any pathogen. 


merging infectious diseases (EIDs) threaten 

the health of wildlife, livestock, domestic 

animals, and humans (J). One of the pri- 

mary contributors to species endanger- 

ment (2), EIDs have also led or contributed 
to notable extinctions, including dozens of 
amphibian species by chytridiomycosis (3), the 
Polynesian tree snail (Partula turgida) by a 
microsporidian infection (4), and 16 spe- 
cies of Hawaiian honeycreepers (Drepanidini, 
Fringillidae) by avian malaria (Plasmodium 
relictum) and avian pox [Poxvirus avium (2)]. 
EIDs also can have profound impacts on so- 
ciety; indeed, the emergence of severe acute re- 
spiratory syndrome coronavirus 2 (SARS-Cov-2) 
has led to socioeconomic consequences that 
will surely last for years to come (5). 
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Recently, phylodynamics has emerged as an 
invaluable tool for the characterization of the 
epidemiological dynamics of such rapidly evolv- 
ing pathogens (6, 7). By reconstructing pathogen 
phylogenies, phylodynamic analyses elucidate 
critically relevant epidemiological parameters 
such as the effective reproduction number 
(Rx) (8) and the effective number of infections 
(Ng) (9). Re is a generalization of the basic 
reproduction number (Ay) in epidemiological 
models, or the expected number of secondary 
infections from a single infected individual 
entering a wholly susceptible population. Sim- 
larly, Re quantifies transmission in popula- 
tions already infected, predicts when pathogen 
prevalence will increase (Rg > 1) or decrease 
(RE < 1), and helps to estimate the vaccination 
fraction necessary to achieve herd immunity. 

Although phylodynamics has contributed 
substantially to the management of human 
diseases, including the responses to SARS- 
Cov-2 (10), its application to nonviral patho- 
gens such as bacteria has been limited by their 
slower rate of molecular evolution and larger 
genome sizes (17). Past efforts to apply phylody- 
namic approaches to pathogens other than 
viruses have been impeded by the challenges 
associated with large genome size and the 
identification of a measurably evolving por- 
tion of the pathogen genome with which to 
reconstruct phylogenies. However, the exten- 
sion of phylodynamics to nonviral pathogens 
including many wildlife EIDs would prove 
invaluable for management and intervention. 

A marquee example is Tasmanian devils 
(Sarcophilus harrisii; Fig. 1A), which are en- 
dangered by an unusual class of emerging in- 
fectious disease: a transmissible cancer (12). 
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Devil facial tumor disease (DFTD; Fig. 1B) has 
spread across 95% of the devil’s geographic 
range since its discovery in 1996, causing 
localized population declines exceeding 90% 
(13) and a species-wide decline of 80% (12, 14). 
DFTD replicates clonally and is transmitted 
as an allograft through biting during social 
interactions (15). Case fatality rates are nearly 
100%, and devil susceptibility appears to be 
largely universal (72) due in part to limited 
genetic variation caused by historical popula- 
tion bottlenecks (16). 

Although efforts have been made to de- 
scribe DFTD transmission dynamics early in 
the epizootic [e.g., (77)], little is known how its 
epidemiology has changed since emergence. 
Despite initial model predictions of devil ex- 
tinction resulting from frequency-dependent 
transmission (17), populations persist even in 
long-diseased areas, and some populations 
may even be recovering (14, 18). Clearly, more 
analyses are needed to reconcile the discrep- 
ancy between model predictions and empiri- 
cal observations. 

Herein, we characterized the epidemiological 
history of DFTD using whole-genome sequenc- 
ing of 51 tumor samples selected to maximize 
spatiotemporal variation across Tasmania be- 
tween 2003 and 2018 (fig. S1). By screening 
>11,000 genes distributed across the tumor ge- 
nome, we identified a subset that is measurably 
evolving, thus demonstrating an approach that 
enables the application of a suite of phylody- 
namic methods to virtually any pathogen. We 
also dated the approximate emergence time 
of DFTD, determined the number of lineages 
present, tracked the rate and directionality of 
lineage spread, and estimated Nç and Rr. 


Results 
Identification of clocklike genes 


We screened 11,359 total genes across the 
DFTD genome and identified 28 genes (totaling 
431,608 bp) that were sufficiently variable 
(>50 parsimony-informative sites) and evolved 
at an appropriate clocklike manner for use in 
tip-dating our phylogeny (table S1). These 
28 genes were distributed widely across the 
genome without any discernible pattern with 
respect to gene function. Once concatenated 
and aligned for the 51 final samples, the aligned 
matrix was composed of 431,608 total columns, 
2520 parsimony-informative sites, 1893 single- 
tons, 802 doubletons, and 2711 variants found 
in three or more individuals (fig. S2). 


DFTD phylogeography 

Using these 28 genes, we estimated the time of 
DFTD origination to be between 1977 and 1987 
(mean = 1983.93; Fig. 1C and figs. S3, S7, and 
S8), which is compatible with its discovery in 
1996 (12). Field studies indicate an origin of 
DFTD in northeastern Tasmania, with subse- 
quent southern and westward spread across the 
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island (13). However, our phylogeographic re- 
construction showed a more complex pattern of 
DFID spread (Fig. 1D and movie S1). We found 
evidence for two contemporary, monophyletic 
DFTD lineages that emerged early in the epizootic 
and completely overlapped in their geographic 
distributions (Fig. 1, C and D), consistent with 
me o a recent landscape genetics study (19). After 
al reaching central Tasmania, the cancer subse- 
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DFTD phylodynamics 


Estimates of Ry through time are consistent 
with the discovery of DFTD in 1996; a low Rg 
at the time of disease origination would trans- 
late to low prevalence before discovery (Fig. 2A). 
The birth-death skyline, which assembles piece- 
wise constant estimates of Rx, inferred a sudden, 
increased rate of transmission in the late 1990s 
(Fig. 2A). Specifically, Rg increased from ~1 


1978 1988 1998 2008 2018 : 
around 1980 to a maximum of ~3.5. Currently 


Fig. 1. Spread of DFTD lineages across Tasmania. (A) Healthy devil (Photo: David G. Hamilton). Rg is <1, suggesting that DFTD will decrease 
(B) Tasmanian devil infected with DFTD (Photo: Alexandra K. Fraik). (©) Phylogeny of 51 tumor samples; in prevalence. The Ng (Fig. 2B) supports the 
branch colors indicate the estimated geographic diffusion rate. Shown are branch lengths in units of time: same dynamics, showing a rapid increase in 
the x-axis corresponds to year. (D) The same phylogeny mapped in geographic space. Blue circles number of infections when Rg was >1, fol- 
correspond to both terminal and internal nodes, with size and color corresponding to age. Inferred locations | lowed by a stabilization of the infected popu- 
of internal nodes correspond to our sampled lineages; no samples exist at the time and location of lation size coinciding with an Rx of ~1. 

disease origination. Red polygons indicate posterior probability locations of historical infections. Dashed blue The methods used above rely on the assump- 
arrows indicate omnidirectional DFTD spread. tion that transmission dynamics do not vary 


among tumor lineages. Therefore, we relaxed 
these assumptions using a multistate birth-death 
model to determine whether tumor lineages 
differ in their epidemiological dynamics. We 
found that, indeed, transmission rate dynam- 
ics were not uniform across tumor lineages 
(Fig. 3A and fig. S9), with two detectable shifts 
in transmission rate. Re (as estimated under 
the birth-death skyline) declined to just above 
or below 1 toward the present in each of the 
two, reciprocally monophyletic contemporary 
transmission clusters, respectively (Fig. 3B). 
Neither transmission cluster was geographically 
discrete; each was distributed island wide. 





Effective Reproduction 
Number (R) 


Genomic differentiation among transmission clusters 


Our results show that accelerated transmission 
rates of DFTD began in the early to mid-1990s 
(Figs. 2A and 3B). With our data, we were unable 
to unequivocally determine what specifically led 
to this change. However, we screened the 51 tumor 
genomes and identified 791 unique variants that 
differentiated the three identified transmission 
1980 1990 5000 5010 clusters found in the top 0.1% of pFs; (a measure 
Year of genomic differentiation) values. Of these var- 
lants, 687 were intergenic and the remaining 
Fig. 2. Epidemiological dynamics of DFTD. Red arrows indicate the year of DFTD discovery (1996). Shading 104 fell within a total of 68 unique genes and 
indicates the 95% credible intervals for parameter estimates; thick black lines are medians of the posterior 70 unique transcripts (fig. S10). 
distribution. (A) Re through time under the birth-death skyline model. Dashed gray line indicates R_ = 1. Above this These genes are associated with regulation 
line, disease spreads; below it, the number of infections decreases. (B) Ng under the coalescent Bayesian skyline. of (i) STAT3 (NFATC3 and PRKGI), a key DFTD 
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Fig. 3. Heterogeneous transmission rate dynamics in DFTD. (A) Phylogeny inferred under the birth-death 
skyline model. Branch colors show three distinct transmission rate regimes identified under the multistate 
birth-death model. Circles show posterior probability of interior nodes. Vertical dashed gray line 

indicates year of disease detection (1996). (B) Transmission rates through time as inferred under the 
birth-death skyline model for clusters 2 (red) and 3 (green). Median estimates are bold black lines; the 95% 
credible interval is shaded. Red arrows indicate the year of DFTD discovery (1996). Inset map shows the 
distribution of samples belonging to each transmission cluster, with points colored the same as in (A). 


immune evasion protein (20); (ii) Schwann 
cell differentiation, the cell type of DFTD origin 
(21); and (iii) Wnt signaling (ARMC8, CDH17, 
DMX1L1, and PKP2), a key process in cancer 
progression (22) and identified across previ- 
ous DFTD genomic studies (23). The above 
genes were found to be highly differenti- 
ated between clusters 1 (black; Fig. 3A) and 
3 (green), aS well as between clusters 2 (red) 
and 3, but not between clusters 1 and 2. 


Discussion 


Herein, we have demonstrated that the phy- 
lodynamic analytical framework, although 
previously applied exclusively to viruses and 
a few bacteria, can be used to assess more 
slowly evolving pathogens with larger ge- 
nomes. Through careful assessment of clock- 
like evolution across the genome, we were 
able to extract a measurably evolving signal 
across 28 genes totaling 431.6 kb. Note that 
this amount of sequence is far greater than 
that used in phylodynamic studies of viruses, 
including Ebola [~19.9 kb (24)], influenza A 
[13.6 kb (25)], and SARS-Cov-2 [~29.8 kb (26)]. 
Therefore, our approach can be readily ap- 
plied to the study of other nonviral patho- 
gens that previously fell outside of the scope 
of phylodynamic study. When applied to devil 
facial tumor disease, our analyses show (i) that 
DFTD originated well before its discovery in 
1996; (ii) no geographic substructuring among 
two extant tumor lineages; (iii) omnidirectional 
spread of tumors; and (iv) a precipitous decline 
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in transmission to replacement at present, in- 
dicating a shift to endemism. 

Our study indicates that DFTD may have 
originated nearly a decade or more before its 
discovery in 1996. These findings are com- 
patible with field observations; upon initial 
discovery in northeastern Tasmania, tumors 
were large and widespread in the population 
(12), so the disease had likely been circulat- 
ing for some time. Nonetheless, it is possible 
that some host variants persisted in our multi- 
ple sequence alignments (MSAs) despite our 
best efforts to remove potential contamination 
(see the materials and methods). Because devil 
genomic variants will coalesce earlier than 
DFTD variants, a consequence of such contam- 
ination is that our inferred dates of disease 
origination are likely slightly earlier than the 
true origination date. 

One surprising finding of our phylogeo- 
graphic analysis was the apparent lack of 
geographical structuring of DFTD lineages. 
Rather, our results indicate that the disease 
spread omnidirectionally throughout the epi- 
zootic, repeatedly recolonizing previously infected 
populations that experienced substantial pop- 
ulation declines [e.g., (23, 14)]. That is, as devil 
populations reached low densities, they likely 
received infected migrant devils from neighbor- 
ing areas, which is compatible with the observed. 
disease-induced metapopulation dynamics (27) 
and local patterns of DFTD lineage replacement 
(28). Thus, our phylogeographic analysis chal- 
lenges the conventional narrative of an east-to- 
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west disease wave emanating from northeastern 
Tasmania (/2) and instead suggests continuous 
spread in all cardinal directions (Fig. 1D). 

These results differ slightly from those of 
Murchison et al. (21), who recovered evidence 
of fine-scale geographic structuring with the 
Forestier peninsula of southeastern Tasmania. 
Whereas we sampled 51 tumors across Tasma- 
nia between 2003 and 2018, 36 of the 68 tumor 
samples in Murchison et al. (21) came from 
the Forestier peninsula over a 4-year period. 
Additionally, the tumor phylogeny inferred 
by Murchison et al. (21) was based on 16 nuclear 
and 21 mitochondrial variants that were pre- 
sumably somatic. By contrast, our phylogenies 
are based on 5406 total variants distributed 
across 28 measurably evolving genes. 

Terminal branches of our inferred phylog- 
enies are longer than internal branches, dif- 
fering slightly from typical viral phylogenies. 
High within-host tumor diversity is one poten- 
tial explanation, but evidence of this is limited 
and preliminary. The only study that com- 
pared within-host tumor variation found that 
only six tumors from 20 individuals could be 
distinguished. Furthermore, three of these six 
differed by only a single variant, thus demon- 
strating limited within-host variation (27) and 
the potential for superinfection by more than 
one tumor lineage. 

An alternative explanation for long terminal 
branches comes from the widespread geo- 
graphic distribution and complete geographic 
overlap of tumor lineages, which were also 
seen in (19). That is, because of the low-density 
sampling scheme of our study with respect 
to geographic and temporal distribution of 
samples, the probability of sampling two close- 
ly related samples at any time or place is low. 

Although we infer a geographical origin in 
north-central Tasmania, this is likely due to 
the earliest DFTD samples being collected 
from this location. Note that this is not to be 
interpreted as the location at which DFTD 
originated because it is much more probable 
that the disease originated on the eastern 
coast, near the site of discovery (12). Our infe- 
rence of the root location in central Tasmania 
is likely a consequence of the absence of sam- 
ples collected at the general location and time 
of disease origin in northeastern Tasmania. 
Tissue samples of tumors were not collected 
until the early 2000s, so no samples exist from 
the time and location of disease origin. Further, 
the earliest available samples are from central 
Tasmania, not the site of disease origination. In 
turn, we view the reconstructed geographic 
root state as being the location of the most 
recent common ancestor of our samples rather 
than the location of disease origin. 

We found three transmission clusters that 
differ in their epidemiological dynamics and 
have identified a number of genes that may 
contribute to the observed transmission rate 
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Fig. 4. Manhattan plot illustrating differentiation of SNPs in pairwise comparisons of transmission clusters using pFsr. [he top 0.1% most differentiated 
sites are shown in blue, and the top 0.01% most differentiated are shown in red. Select highly differentiated variants including those of functional importance are 
annotated. DFTD-associated genes discussed in the text are annotated. 


variation. The absence of highly differentiated 
sites among clusters 1 and 2 could be caused 
by the minimal change in transmission rate at 
the time of origination of cluster 2 (Fig. 3B). 
These genes may thus contribute to the ob- 
served transmission rate variation and are can- 
didates in need of further functional validation 
in vitro (Fig. 4 and tables S4 and S5). 
Molecular evolution among DFTD trans- 
mission clusters could occur in two ways. In 
the first, convergent molecular evolution would 
be evidenced by the same genes becoming 
differentiated in each derived cluster (Fig. 3A). 
Alternatively, evolution could result in the dif- 
ferentiation of unique genes in each cluster. In 
general, each derived transmission cluster har- 
bors a largely unique set of candidate SNPs, 
implicating unique mechanisms by which 
transmission rates vary. We only found a single 
gene (NAALADL2) consistent with the former 
(convergent) pattern of molecular evolution 
between clusters 2 and 3 (fig. S10 and table 
S4). This gene represents the only evidence 
recovered for potential convergent molecular 
evolution and may be important with respect 
to the oncogenicity of DFTD. NAALADL2 has 
been demonstrated to be overexpressed in 
colon and prostate tumors compared with 
benign tumors in humans (29), and it promotes 
a tumor phenotype that exhibits greater capac- 
ity to migrate and metastasize. Future study 
of this gene in the context of DFTD evolution 
is thus warranted. Nonetheless, the observed 
transmission rate declines could be due to 
the accumulation of mutation load, which 
has been found in the canine transmissible 
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cancer (30). Additionally, study of patterns of 
molecular evolution in the 28 clocklike genes 
identified in our study presents an exciting 
avenue of research. 

Contrary to expectations of Tasmanian devil 
extinction (13, 17), our results suggest that 
DFTD is transitioning from emergence to 
endemism. The large decline of Rẹ in the two 
pandemic tumor lineages suggests either co- 
existence or DFTD extinction, a result consist- 
ently supported by recent models and field data 
(27, 3D. This decrease in Rg may be a conse- 
quence of devil population declines, leading to 
fewer transmission opportunities. Current low 
Ry values are consistent with a reduced force 
of infection estimated using mark-recapture 
field data (32) and possible demographic recov- 
ery of some populations (33). Contemporary 
transmission dynamics may result in either 
the long-term coexistence of DFTD and devils 
or DFTD extinction, outcomes predicted by 
individual-based models (32). However, the 
potential for coexistence does not imply recov- 
ery of devils to pre-DFTD population sizes; cur- 
rent models predict persistent, substantially 
reduced devil densities (37). 

Our results support the growing body of 
evidence that if DFTD continues to progress 
naturally, then devil extinction is unlikely. 
Therefore, we urge caution in the considera- 
tion of introductions of captive-bred devils to 
infected populations, a practice that has already 
been set into action (34). If reduced devil 
densities are contributing to reduced trans- 
mission rates (27), then an artificial increase 
of population densities through such intro- 
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ductions may be unwise. Further, introduction 
of disease-naive devils may swamp adaptation 
to this new selective pressure (35). Instead, the 
preferred management option may be to allow 
natural evolution to occur and only introduce 
captive devils as a last resort if demographic 
rescue is urgently needed. 

The existence of two widespread transmis- 
sion clusters with unique epidemiological dy- 
namics also suggests that mitigation strategies 
such as an oral bait vaccine (36) should ac- 
count for this diversity. Unfortunately, a second, 
independently evolving transmissible cancer, 
DFT2, has been discovered in southeastern 
Tasmania (37) and may become a growing 
threat. Transmissible tumors may thus be com- 
monplace throughout the evolutionary history 
of devils, but initial studies do not support this 
conclusion (38). Comparative phylodynamics 
of DFT2 and DFTD will be essential to under- 
stand their relative transmission dynamics. 
Our results suggest cautious optimism for the 
continued survival of the iconic Tasmanian 
devil but emphasize the need for evolutionar- 
ily informed management practices to ensure 
its persistence (78). 


Materials and Methods 
Sample collection 


Tumor biopsies were not collected from 
Tasmanian devils until the early 2000s, cor- 
responding to the time of our earliest samples. 
Samples from the University of Tasmania col- 
lection were obtained under University of 
Tasmania ethics approval A13326 and Wash- 
ington State University institutional animal 
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care and use committee approval ASAF 6796. 
Tumor tissue biopsies were obtained using a 
3-mm biopsy punch from wild devils and de- 
tailed field trapping protocols that have been 
previously described (12, 39). 

Early sampling efforts across Tasmania have 
historically been heterogeneous, with much 
of the effort being focused on collection at 
disease fronts. Therefore, we made a concerted 
effort to incorporate samples that maximized 
the available temporal range (2003 to 2018) and 
geographic distribution of samples available 
in the two largest sample repositories at the 
University of Tasmania and the Tasmanian 
Department of Primary Industries, Parks, Water 
and Entertainment, Tasmania, Australia. Where 
possible, samples were obtained such that 
the dataset represented the geographical dis- 
tribution of DFTD in Tasmania through time, 
with the caveat that samples from eastern 
Tasmania are unavailable earliest in the col- 
lections, and collection has ceased at a number 
of sites in central Tasmania. Further, for devils 
that had tumor biopsies taken multiple times, 
we selected tumor biopsies that were associ- 
ated with the first tumor biopsy for that devil. 

Thus, in 2018 we obtained 50 tumor sam- 
ples spread both geographically and tem- 
porally (2003 to 2018; fig. S1) across Tasmania. 
Whole-genomic libraries were prepared using 
NEB-next (New England Biolabs, Ipswitch, 
MA, USA) kits and sequenced on 26 lanes 
of a 10x Illumina Platform at Northwestern 
Genomics (Seattle, WA). An additional six 
whole-genome tumor samples originally se- 
quenced for another study (23) were also in- 
cluded (BioProject PRJNA4:72767; BioSamples 
SAMINO09242213, SAIMN09242220, SAMINO9242229, 
SAMN09242223, SAMN09242224, and 
SAMN09242226). 


Sequencing read assembly 


Raw reads were merged using flash2 (40) and 
adaptors trimmed with sickle (47). Trimmed 
reads were then aligned to the reference ge- 
nome of S. harrisii (21) using bwa mem (42). 
Bam files produced from this step were sub- 
sequently sorted using Samtools (v1.9) sort (43). 
Samtools merge was then used to combine the 
sorted paired-end and single-end bam files 
for each sample; these combined bams were 
subsequently merged across lanes. Polymerase 
chain reaction duplicates were removed using 
Picard Markduplicates (44), and final bam files 
were indexed using the Samtools index. 


Phylogenetic sequence preparation 


To date, most phylodynamic studies have been 
conducted on extremely rapidly evolving viruses. 
Given that DFTD evolves much more slowly, we 
needed to ensure that the loci used to gen- 
erate our phylogeny were measurably evolv- 
ing. To accomplish this, we used an iterative 
approach that first identified a candidate set 
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of genes evolving in a clocklike manner in 
both devils and tumors. 

We first generated consensus sequences for 
the 22,391 annotated genes for the Tasmanian 
devil genome (Devil_ref v7.0, INSDC Assembly 
GCA_000189315.1). We generated consensus 
sequences for 48 of the 56 tumor isolates and 
14 devil individuals using Samtools faidx. We 
then generated individual MSAs for 11,359 of 
the 22,391 genes using Clustal Omega (45). We 
only generated ~50% of the alignments be- 
cause of computational constraints; these 
alignments took ~3 months to finish. How- 
ever, the aligned genes were distributed across 
chromosomes I through 6, as well as the 
X chromosome, so we are not concerned 
that we are including a biased sample of the 
genome. We then summarized each alignment 
using AMAS (46), which quantifies the num- 
ber of variable and parsimony-informative 
sites per gene. 

Using these aligned genes, we subsequently 
inferred individual gene trees using IQ-TREE 
(47). Specifically, we fitted models of nucleo- 
tide substitution using the -m TEST flag, and 
the best-fit model was subsequently used for 
tree inference. We then tested, using each of 
the 11,359 gene trees, for a clocklike substi- 
tution process using the scripts provided by 
Murray et al. (48). These scripts regress phylo- 
genetic root-to-tip distance against sampling 
date in a manner analogous to TempEst (49) 
to find the root branch that maximizes the fit 
of the regression. Significance was assessed by 
randomly permuting sampling dates across the 
tips 500 times, with the correlation coefficient 
as the test statistic. 

We sought to identify a candidate set of 
genes that both exhibited a significant and 
realistic clocklike substitution process and 
harbored a sufficient number of phylogenet- 
ically informative sites to resolve the phylo- 
geny. Therefore, we filtered these 11,359 genes 
such that only 364 were found to be evolving 
in a significant, clocklike manner (P < 0.05), 
harboring =50 parsimony sites, and with time 
to most recent common ancestor (MRCA) 
being inferred as <200 years before present 
(ybp). From this point forward, an additional 
eight tumor samples were included into our 
analyses, bringing the total number of sampled 
tumors to 56. These eight samples were pre- 
viously excluded because of spontaneous regres- 
sion [see (23)] but are included herein because 
we are specifically interested in broadscale 
patterns of transmission across Tasmania. 
The phylogenetic positions of these samples 
are indicated in fig. S3. 

Next, we sought to remove single-nucleo- 
tide polymorphisms (SNPs) present in both 
devils and tumors, which would be indicative 
of potential host contamination in the tumor 
sample. To accomplish this, we called SNPs 
and indels for all 56 tumor samples using 
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bcftools mpileup and bcftools call -mv. Indels 
were then normalized using bcftools norm -m. 
We then compared these SNPs and indels with 
those identified in the available 12 Tasmanian 
devil high-coverage (~30x) whole-genome 
sequences from Margres et al. (35) using 
bcftools isec to identify which SNPs and indels 
were unique to the tumor samples. We filtered 
out all variants shared between devils and 
tumors, leaving putatively somatic tumor va- 
riants that are not found as standing genetic 
variation in devil populations. This approach 
has been successfully applied to the study of 
DFTD recently (50) and is analogous to the 
approach used to study somatic variants in 
canine transmissible venereal tumor (57). Per- 
sample consensus sequences were then called 
using bcftools consensus in the manner de- 
scribed above. 

We then inferred a final set of MSAs of all 
56 tumor samples using MAFFT (52). Specif- 
ically, we ran MAFFT using the following 
settings: -local-pair -max-iterate 1000 and the 
accurate L-INS-I alignment method. We then 
repeated the workflow outlined above to infer 
gene trees using IQ-TREE, to test for clocklike 
signal, and to summarize alignments. As above, 
we subsequently filtered resultant MSAs such 
that only 28 genes were retained that were 
evolving in a significant (P < 0.05) clocklike 
manner, harboring >50 parsimony-informative 
sites, and exhibiting an MRCA < 200 ybp. 


Phylodynamic inference 


To expedite tree inference in our Bayesian 
phylodynamic analyses, we inferred a starting 
tree using concatenated sequences after auto- 
mated model selection with IQ-TREE. We 
leveraged three complementary Bayesian phylo- 
dynamic methods in Beast v2.5.1 (63): Bayesian 
continuous phylogeography [visualized in 
SpreadD3 (54, 55)], coalescent Bayesian skyline 
(9), and birth-death skyline (8) to: (i) charac- 
terize the phylogeographic history of DFTD 
as it spread across Tasmania, (ii) infer the 
demographic history of the disease, and (iii) 
quantify the rates of transmission through- 
out the epizootic. To test for among-lineage 
variation in transmission rates, we applied 
the multistate birth-death model (56), as im- 
plemented in R v3.6.1 (57), to the birth-death 
skyline maximum clade consensus tree. To 
quantify transmission rate variation in each 
cluster, we used the birth-death skyline model 
for the samples within each cluster. To prevent 
overfitting, we reduced the number of time 
series estimated Ag, per cluster to five. We ad- 
ditionally used TreeTime (58) to conduct a root- 
to-tip regression of molecular divergence in 
the ML tree against time of sampling to ob- 
tain an estimate of the time of disease origin- 
ation. Details specific to each analysis are 
described in the extended materials and meth- 
ods in the supplementary materials. 
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Phylogenetic inference: Starting tree 

As before, we implemented model selection in 
IQ-TREE on the concatenated sequence before 
topology inference. To quantify topological 
uncertainty, we conducted 100 rounds of 
bootstrapping. The ML tree was then read 
into TempEst to identify problematic sequen- 
ces. We identified and removed five tumors 
that had outlying residuals or appeared to 
have mislabeled tip-dates. These five samples 
were excluded from all downstream analyses. 
In turn, we generated our final set of 51 tumor 
samples presented in the main text. After re- 
moval of these tumors, the inferred topology 
was subsequently visualized using FigTree (59). 


Bayesian phylodynamic inference 


For each of the Bayesian phylodynamic analy- 
ses described above, the best-fit model of 
nucleotide evolution as determined by IQ-TREE 
for the concatenated sequence was set in 
BEAST. Nondefault priors are shown in table 
S2. Four independent Markov chain Monte 
Carlo (MCMC) algorithms were run for 50 mil- 
lion generations, sampling every 1000 gener- 
ations. Individual MCMC traces were compared 
to assess convergence using Tracer. Convergent 
chains were combined using LogCombiner, dis- 
carding the first 20% of samples for each chain 
as burn-in. Last, a single chain of 50 million 
generations was run, sampling from the prior 
only; these MCMC traces were used to assess 
deviation of the posterior from the prior for 
each estimated parameter. Maximum clade 
consensus trees from combined posterior dis- 
tributions for each model were obtained using 
TreeAnnotater and visualized in FigTree. 


Genomic differentiation among 
transmission clusters 


To characterize genetic differences among 
transmission clusters identified under the 
multistate birth-death model, we identified 
SNPs in the genome exhibiting particularly 
strong differentiation among clusters, which 
are interpreted as candidates for the explana- 
tion of transmission rate variation using the pF sr 
association test implemented in the GPAT++ 
software package (60, 67). Specifically, we used 
this approach to identify candidate genomic 
variants that could explain the observed trans- 
mission rate variation. The pF sr; association test 
conducts a likelihood ratio test of allele fre- 
quency differences among populations, correct- 
ing for sequencing error using genotype 
likelihood scores in the calculation of model 
parameters. This approach has previously been 
successfully applied to the study of DFTD (50). 
To ensure that only confidently identified SNPs 
were retained in this analysis, we filtered the 
dataset using VCFtools v0.1.16 (62) using the 
following flags: -mac 4, -max-alleles 2, -min- 
alleles 2, -minDP 10, -max-missing 0.8, and - 
remove-indels. Specifically, we made all pair- 
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wise comparisons among identified transmis- 
sion clusters and/or regimes. We subsequently 
characterized and extracted annotations from 
Ensembl for differentiated SNPs using the var- 
lant effect predictor (63). 
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INTRODUCTION: Determining the structures of 
protein complexes is crucial for understanding 
cellular functions. Here, we describe an inte- 
grative structure determination approach that 
relies on in vivo quantitative measurements of 
genetic interactions. Genetic interactions report 
on how the effect of one mutation is altered by 
the presence of a second mutation and have 
proven effective for identifying groups of genes 
or residues that function in the same pathway. 
The point mutant epistatic miniarray profile 
(pE-MAP) platform allows for rapid measure- 
ment of genetic interactions between sets of 
point mutations and deletion libraries. A pE- 
MAP is made up of phenotypic profiles, each 
of which contains all genetic interactions be- 
tween a single point mutant and the entire 
deletion library. 


RATIONALE: We observe a Statistical associa- 
tion between the distance spanned by two 
mutated residues in a protein complex and 
the similarity of their phenotypic profiles 


(phenotypic similarity) in a pE-MAP. This 
observation is in agreement with the expecta- 
tion that mutations within the same func- 
tional region (e.g., active, allosteric, and binding 
sites) are likely to share more similar phenotypes 
than those that are distant in space. Here, we 
explore how to use these associations for de- 
termining in vivo structures of protein com- 
plexes using integrative modeling. 


RESULTS: We generated a large pE-MAP by 
crossing 350 mutations in yeast histones H3 
and H4 against 1370 gene deletions (or hypo- 
morphic alleles of essential genes). The phe- 
notypic similarities were then used to generate 
spatial restraints for integrative modeling of 
the H3-H4 complex structure. The resulting 
ensemble of H3-H4 configurations is accurate 
and precise, as evidenced by its close similarity 
to the crystal structure. This finding indicates 
the utility of the pE-MAP data for integrative 
structure determination. Furthermore, we show 
that the pE-MAP provides a wealth of biological 





In vivo structure determination using genetic interactions. pE-MAPs are generated by measuring the 
growth of yeast colonies (left) and visualized as a heatmap (background). We present an application of 
pE-MAPs to determine protein complex structures, using integrative modeling, and apply it to histones H3 
and H4 (right) and other complexes. H3 (purple) and H4 (teal) are highlighted in the context of the 
nucleosome [gray, modified Protein Data Bank (PDB) 1ID3]. 
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insight into the function of the nucleosome and 
can connect individual histone residues and 
regions to associated complexes and processes. 
For example, we observe very high phenotypic 
similarities between modifiable histone resi- 
dues and their cognate enzymes, such as H3K4 
and COMPASS, or H3K36 and members of 
the Set2 pathway. Furthermore, the pE-MAP 
reveals several residues involved in DNA repair 
and others that function in cryptic transcription. 

We demonstrate that the approach is trans- 
ferable to other complexes and other types 
of phenotypic profiles by determining the 
structures of two complexes of known struc- 
ture: (1) subunits Rpb1 and Rpb2 of yeast RNA 
polymerase II, using a pE-MAP of 53 point 
mutants crossed against 1200 deletions and 
hypomorphic alleles; and (ii) subunits RpoB 
and RpoC of bacterial RNA polymerase, using 
a chemical genetics map of 44 point mutants 
subjected to 83 environmental stresses. The 
accuracy and precision of the models are compa- 
rable to those based on chemical cross-linking, 
which is commonly used to determine protein 
complex structures. Moreover, the accuracy 
and precision improve when using pE-MAP 
and cross-linking data together, indicating 
complementarity between these methods and 
demonstrating a premise of integrative struc- 
ture determination. 


CONCLUSION: We show that the architectures 
of protein complexes can be determined using 
quantitative genetic interaction maps. Because 
pE-MAPs contain purely phenotypic measure- 
ments, collected in living cells, they generate 
Spatial restraints that are orthogonal to other 
commonly used data for integrative modeling. 
The pE-MAP data may also enable the charact- 
erization of complexes that are difficult to 
isolate and purify, or those that are only tran- 
siently stable. Recent advances in CRISPR- 
Cas9 genome editing provide a means for 
extending our platform to human cells, al- 
lowing for identification and characterization 
of functionally relevant structural changes 
that take place in disease alleles. Expanding 
this analysis to look at structural changes in 
host-pathogen complexes and how they affect 
infection will also be feasible by introducing 
specific mutations into the pathogenic genome 
and studying the phenotypic consequences 
using genetic interaction profiling of relevant 
host genes. 
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Determining structures of protein complexes is crucial for understanding cellular functions. Here, we 
describe an integrative structure determination approach that relies on in vivo measurements of genetic 
interactions. We construct phenotypic profiles for point mutations crossed against gene deletions or 
exposed to environmental perturbations, followed by converting similarities between two profiles into an 
upper bound on the distance between the mutated residues. We determine the structure of the yeast 
histone H3-H4 complex based on ~500,000 genetic interactions of 350 mutants. We then apply the 
method to subunits Rpb1-Rpb2 of yeast RNA polymerase II and subunits RpoB-RpoC of bacterial RNA 
polymerase. The accuracy is comparable to that based on chemical cross-links; using restraints from 
both genetic interactions and cross-links further improves model accuracy and precision. The approach 
provides an efficient means to augment integrative structure determination with in vivo observations. 


mechanistic understanding of cellular 

functions requires structural charac- 

terization of the corresponding macro- 

molecular assemblies (7). Traditional 

structural biology methods—such as 
x-ray crystallography, nuclear magnetic res- 
onance (NMR) spectroscopy, and electron 
microscopy (EM)—rely on purified samples and 
are generally not applicable to heterogeneous 
samples, such as those of large, membrane- 
bound, or transient assemblies (2). Moreover, 
these methods do not determine the structures 
in their native environments, therefore in- 
creasing the risk of producing structures in 
nonfunctional states or missing relevant func- 
tional states. 

Integrative structure determination has 
emerged as a powerful approach for determin- 
ing the structures of biological assemblies (3). 
The motivation is that any system can be des- 
cribed most accurately, precisely, completely, 
and efficiently by using all available informa- 
tion about it, including varied experimental 
data (e.g., chemical cross-links, protein inter- 
action data, small-angle x-ray scattering pro- 
files) and prior models (e.g., atomic structures 
of the subunits). Integrative methods can often 
tackle protein assemblies that are difficult to 
characterize using traditional structural biology 
methods alone (1, 4-10). Spatial data generated 
by in vivo methods are especially useful for 
integrative structure determination (17). There- 
fore, high-throughput in vivo methods are 
needed to supplement low-throughput in vivo 
methods, such as single-molecule Förster res- 
onance energy transfer spectroscopy (12). 
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Here, we describe how integrative structure 
modeling can benefit from spatial restraints 
derived from in vivo quantitative measure- 
ments of genetic interactions. A genetic inter- 
action between two mutations occurs when 
the effect of one mutation is altered by the 
presence of the second mutation (Fig. 1A) (73). 
Positive genetic interactions (epistasis or sup- 
pression) arise when the double mutant is 
healthier than expected, whereas negative in- 
teractions (synthetic sickness) arise in rela- 
tionships where the double mutant is sicker 
than expected. Single genetic interactions can 
often be difficult to interpret in isolation. A 
phenotypic profile, defined as a set of genetic 
interactions between a given mutation (e.g., 
a point mutation) and a library of secondary 
mutations (e.g., gene deletions), can be more 
informative (Fig. 1B) (14). A point mutant 
epistatic miniarray profile (pDE-MAP) is com- 
posed of such phenotypic profiles for all 
mutations in the analysis (Fig. 1C) (15). We 
have previously found a statistical association 
between the distance between two mutated 
residues in the wild-type (WT) structure and 
the similarity between their phenotypic pro- 
files (i.e., phenotypic similarity) (15, 16) (Fig. 
1D). This observation is in agreement with the 
expectation that mutations within the same 
functional region (e.g., active, allosteric, and 
binding sites) are likely to share more similar 
phenotypes than those that are distant in space 
(17-19). Here, we explore how to use these as- 
sociations for determining in vivo structures 
of macromolecular assemblies using integra- 
tive modeling (Fig. 1E). To enable this analysis, 
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we generated a large pE-MAP, by designing a 
comprehensive set of 350 mutations in histo- 
nes H3 and H4 and crossing these against 1370 
gene deletions (or hypomorphic alleles for es- 
sential genes). We describe this pE-MAP and 
illustrate integrative structure determination 
by its application to three complexes of known 
structure: (i) the yeast histones H3 and H4; (ii) 
subunits Rpbl and Rpb2 of yeast RNA poly- 
merase II (RNAPII), using a pE-MAP dataset 
of 53 point mutants crossed against a library 
of 1200 deletions and hypomorphic alleles 
(15); and (iii) subunits RpoB and RpoC of bac- 
terial RNA polymerase (RNAP), using a chem- 
ical genetics miniarray profile (CG-MAP), where 
44 point mutants were subjected to 83 different 
environmental stresses (e.g., treatments with 
chemicals and temperature shocks) (20). 


A comprehensive pE-MAP of histones H3 and H4 


Histones are central to chromatin structure 
and dynamics because they make up the core 
of the nucleosome, the fundamental repeating 
unit of chromatin. The state of the nucleosome 
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Fig. 1. Building spatial restraints from pairwise genetic perturbations. 
(A) Genetic interactions arise when the combined fitness defect of a double 
mutant deviates from the expected multiplicative growth defect of the two 
single mutants. (B) The generation of a pE-MAP relies on a collection of point 
mutations, which is constructed by systematic mutagenesis of genes that 
encode the subunits of a macromolecular assembly (mutations labeled 

1 to 4). The point-mutant strains are then crossed against a library of gene 
deletions, followed by fitness measurement and subsequent calculation of 


is controlled by histone posttranslational 
modifications (PTMs) (217)—including acet- 
ylation, methylation, phosphorylation, and 
ubiquitination—that help maintain and regu- 
late chromatin structure and transcription. Our 
library of point mutations in the core histones 
H3 and H4 was designed to comprise a com- 
prehensive alanine scan, as well as context- 
specific mutations of modifiable residues (e.g., 
lysine and arginine), such as charge removal 
or reversal and substitutions mimicking PTMs 
(22, 23). Partial deletions of the N-terminal 
tails of H3 and H4 were also included, because 
these regions play important and sometimes 
redundant roles in chromatin biology (24, 25). 
In budding yeast, histones H3 and H4 are ex- 
pressed from two loci each, HHTI/HHT?2 and 
HHFI/HHE?2, respectively. To ensure preserva- 
tion of the native expression levels, we engi- 
neered each strain to include identical point 
mutations in both relevant loci with separate 
selection markers (HYG* and URA3) (Fig. 2A). 
In total, we designed 479 histone mutants, of 
which 350 were amenable to pE-MAP analysis 
(Fig. 2, B to D, and table S1); the remaining 129 
mutants either were lethal or exhibited very 
poor growth, rendering them inaccessible to 
genetic analysis (fig. S1). The histone mutants 
were crossed against a library of 1370 gene de- 
letions and hypomorphic alleles (table S1) using 


Braberg et al., Science 370, eaaz4910 (2020) 





Point-mutant 
alleles 


Wild-type cell 


~ (S) 





e 
oP 
$ 


D 


Phenotypic similarities 


Translate 
MIC values to 
spatial 
restraints 


Data likelihood 





Distance [Å] 








our triple-mutant selection strategy (26, 27) 
involving three different selectable markers 
(HYG? and URA3 to select for both copies of 
the histone alleles and KAN? for the knock- 
out library strains) (Fig. 2A and Methods) 
(26). Genetic interactions were quantified using 
the S-score (28), which measures the devia- 
tion of the double mutant fitness from the 
expected combined effect of the individual 
mutations (Methods). The pE-MAP screen 
was carried out in three biological replicates 
(Methods), which exhibit a high reproducibil- 
ity (Fig. 2E), and the final S-scores (as depicted 
in Fig. 1C) are the averages of these replicates. 

It has been shown that a pE-MAP can be 
used to predict protein-protein interactions 
(PPIs) by comparing the genetic interaction 
patterns between pairs of deletion mutants 
across all the point mutants (15, 29). On a global 
level, this is only possible if the point mutant 
set affects a broad group of processes and ex- 
hibits genetic interactions with the many dif- 
ferent deletion mutants that encode the PPI 
proteins. Because the histone mutant collec- 
tion perturbs only two proteins (H3 and H4), 
we Set out to investigate whether the resulting 
phenotypic profiles are sufficient to predict 
PPIs among the 1370 deletion mutants. Using 
a receiver operating characteristic (ROC) curve, 
we find that the histone pE-MAP predicts PPIs 
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genetic interaction scores to obtain the phenotypic profiles. (C) An 
example subset of a pE-MAP of point mutants crossed against a library of 
gene deletions. (D) Each pairwise combination of phenotypic profiles is 
transformed into a single MIC value that reflects the similarity between 

the two profiles. The MIC values are translated into spatial restrains 

for integrative modeling. (E) The MIC values and other input information 
are used for integrative structure modeling. An ensemble of structures that 
Satisfy the input information is obtained. 


similarly to previous E-MAPs that affect more 
genes (15, 29) (Fig. 2F and Methods). This find- 
ing indicates that the combined set of histone 
point mutants affects a broad set of cellular 
processes, reflecting the multifunctional nature 
of histones H3 and H4 and their central role 
in controlling the global genetic environment 
of cells. 

To gain insight into the regulatory hierarchy 
that drives the widespread functional effects 
of histone perturbations, we set out to examine 
the relationship between genetic interactions 
and gene expression changes. To this end, we 
determined the genome-wide gene expression 
levels for 29 representative histone mutants 
using RNA sequencing (RNA-seq) and found 
no correlation between the expression change 
of a gene resulting from a given histone muta- 
tion and the corresponding S-score (Fig. 2G 
and table S2). This indicates that observed 
genetic interactions between histone muta- 
tions and deletion mutants are due to complex 
regulatory patterns, rather than the histone 
mutation directly modulating the expression 
of the interacting gene. 

The pE-MAP was clustered hierarchically 
along both dimensions (Fig. 3A and data S1) 
and effectively recapitulates known protein 
complex and pathway memberships. For exam- 
ple, the pE-MAP identified COMPASS (30, 31), 
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Fig. 2. Genetic interrogation of histones H3 and H4 at a residue-level resolu- 
tion. (A) Each histone mutant strain was modified at both native loci (HHT and 
HHT2 for H3 or HHF1 and HHF2 for H4, red stars) and crossed against a library of 
1370 different deletion mutants (or hypomorphic alleles for essential genes). Chr, 
chromosome. (B) Schematic of the histone point mutants analyzed in this study 
(table S1). Secondary structure elements are indicated as ribbons above the amino 
acid sequence. The mutations are color coded according to the mutation introduced 
(C). Mutations resulting in inviable strains or strains too sick for genetic analysis 
are shown in fig. S1. (C) Table of histone mutant categories and their hypothesized 
effects [color coding as in (B)]. (D) Overview of viable H3 and H4 tail deletion 
mutants amenable to pE-MAP analysis. The amino acid sequences of the WT alleles 
are shown on top (residues 1 to 39 of histone H3 and 1 to 27 of histone H4). Gray 
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bars represent the deleted residues in H3 and H4. (E) Reproducibility of histone 
pE-MAP S-scores between biological replicates. Plotted are all S-score pairs among 
the biological replicas, which include triplicate measurements for 346 histone alleles 
and duplicates of four alleles (H4E73Q, H4HI18A, H4121A, and H4K440Q). (F) ROC 
curves showing the power to predict physical interactions between pairs of proteins 
from this pE-MAP (blue) as well as a previously published pE-MAP [green, (15)] 
and E-MAP [black, (29)] data. (G) Relationship between gene expression (log. 

fold change over WT) and S-scores of 29 H3 and H4 alleles (table S2). Data from all 
1256 deletion library mutants that were measured in both RNA-seq expression 
and pE-MAP analysis are plotted. Single-letter abbreviations for the amino acid residues 
are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; 
L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 
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Fig. 3. The genetic interaction landscape of histones H3 and H4. 

(A) Hierarchically clustered pE-MAP of 350 histone H3 and H4 alleles 
screened against a library of 1370 deletion mutants or hypomorphic alleles. 
The pE-MAP consists of more than 479,000 genetic interactions. Positive 
(suppressive or epistatic) and negative (synthetic sick) genetic interactions 
are colored in yellow or blue, respectively. Examples of histone alleles with 
similar genetic interaction profiles are highlighted on the right side in the 
context of the nucleosome structure. The nucleosome structure is modified 


Swrl-C (32), and the Set2-Eaf3 pathway (33-35), 
as well as clusters of genes linked to telomere 
maintenance and Golgi-ER traffic (Fig. 3B and 
data S1). Furthermore, mutations of histone 
residues in close proximity to each other (e.g., 
mutants of the H3 or H4 N-terminal tails) 
tend to show similar phenotypic profiles (Fig. 
3 and fig. S2A). Overall, we find that histone 
tail deletion mutants give rise to stronger 
phenotypic profiles than the point mutants 
(fig. S2B), reflecting the multiple residue per- 
turbations and the importance of functional 
histone tails for cell homeostasis. 


Phenotypic profile similarities are correlated 
with structural proximity 


Similarities between pairs of phenotypic pro- 
files in the histone H3-H4 pE-MAP were quan- 
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tified by the maximal information coefficient 
(MIC) (36, 37) (Fig. 1D, fig. S3, and Methods). 
The MIC values between pairs of phenotypic 
profiles do not linearly correlate with the dis- 
tances between the mutated residues in the 
WT structure (Pearson correlation coefficient 
of —0.07, Ca-Ca distances) but are informative 
about an upper distance bound between the 
residues (Fig. 4A and fig. S3C). The upper 
distance bound was obtained by binning the 
MIC values into 20 intervals and selecting 
the maximum distance spanned by any pair 
of residues in each bin, followed by fitting a 
logarithmic decay function to these maximum 
distances (Fig. 4A, fig. S3C, and Methods). 
The data show that a pair of proximal point 
mutations are more likely to have a high MIC 
value than a pair of distal point mutations. How- 
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from PDB 1ID3 (data S2), with H3 in purple, H4 in green, and mutated or 
deleted residues highlighted in red. N-terminal tail residues of H3 and H4 not 
included in PDB 1ID3 are visualized as strings on the periphery. (B) Examples of 
genetic interaction profiles of gene clusters belonging to known protein 
complexes or biological pathways are highlighted and their genetic interaction 
profiles enlarged from (A). DDR, DNA damage or repair; UPP, ubiquitin 
proteasome pathway; SAGA, Spt-Ada-Gcn5d acetyltransferase; SWI/SNF, 
SWltch/sucrose non-fermentable. 


ever, not all proximal mutations have a high MIC 
value: Most pairs of phenotypic profiles, even 
those for residues that are less than 16 A apart, 
are highly dissimilar (94% of all pairs exhibit a 
MIC value <0.3). These observations justify 
converting the pE-MAP data into a Bayesian 
data likelihood that provides an upper bound 
on the distance spanned by the mutated resi- 
dues (Fig. 4B and Methods). This Bayesian term 
objectively interprets the noise in the experi- 
mental data and allows us to quantify the 
uncertainty of the resulting structural models. 
The complete scoring function for evaluat- 
ing any structural model also includes simple 
terms accounting for excluded volume and se- 
quence connectivity, in addition to the Bayesian 
terms for all pairs of profiles in the pE-MAP 
with a MIC value above 0.3 (Fig. 5). 
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Fig. 4. Generation of the scoring function. 

(A) Relationship between pairwise distances and MIC 
values. The solid gray line represents the logarithmic 
decay fit to the upper distance bounds (Methods, Eq.1). 
The background color gradient reflects how the data 
likelihood depends on MIC value and distance. (B) -Log 
of the data likelihood as a function of distance for 
different MIC values (Methods). 


Spatial restraints derived from pE-MAP data can 
be used for integrative structure determination 
An ensemble of the H3-H4 dimer configura- 
tions that satisfy the input information (i.e., 
the model) was found by exhaustive Monte 
Carlo sampling guided by the scoring function, 
starting with random initial configurations 
of the rigid comparative models of the H3 
and H4 subunits (Fig. 5 and Methods). The 
resulting ensemble is accurate and precise, 
as demonstrated by the similarity between 
the x-ray structure [Protein Data Bank (PDB) 
1ID3, (38)] and model contact maps (Fig. 6, A 
and B). Specifically, the mean accuracy is 3.8 A 
(Fig. 6C); the accuracy is defined as the aver- 
age Ca root-mean-square deviation (RMSD) 
between the x-ray structure and each of the 
structures in the ensemble. The precision is 
1.0 A (Fig. 6C), which is defined as the average 
RMSD between all solutions in the ensemble. 
As a control, we also computed a model from 
randomly shuffled MIC values. The resulting 
model (Fig. 6D) is incorrect (mean accuracy of 
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Fig. 5. Description of the integrative modeling workflow. The four stages include (i) gathering all available 
experimental data and prior information, (ii) translating all information into a representation of the assembly 
components and a scoring function for ranking alternative assembly structures, (iii) sampling structural models, and 
(iv) validating the model. In this example, the representation of the components of a complex is based on 
comparative models of its components. The scoring function consists of spatial restraints that are obtained from 
pE-MAP and/or cross-linking experiments (evolutionary coupling analysis is not indicated in this scheme) as well as 
excluded volume and sequence connectivity restraints. The sampling explores the configurations of rigid 
components, searching for those assembly structures that satisfy the spatial restraints as well as possible. The 
goal is to obtain an ensemble of structures that satisfy the input data within the uncertainty of the data used to 
compute them. The sampling precision is estimated and models are clustered and evaluated by the degree to which 
they satisfy the input information used to construct them as well as omitted information. The protocol can iterate 
through the four stages until the models are judged to be satisfactory, most often based on their precision and the 


degree to which they satisfy the data. 


15.8 A and incorrect contact map; Fig. 6, C and 
D) and imprecise (7.6 A; Fig. 6C). As another 
control, we computed a model by a state-of- 
the-art protein-protein docking method (39), 
resulting in a model with an inferior accuracy 
of 6.9 A (fig. S4). Finally, we also mapped the 
accuracy and precision of the model as a func- 
tion of the fraction of the pE-MAP data used 
(Methods). As expected, the more pE-MAP data 
that are used, the more accurate and precise 
is the model (Fig. 6C). 

To compute the structure of a protein com- 
plex for which the structures of the compo- 
nents are known, we estimate that 35 to 40 
mutations per component are necessary to 
generate a complex model with precision suf- 
ficient to map the positions and relative orien- 
tations of the components (fig. S5 and Methods). 
What is a useful model precision depends on 
the questions asked (40). Fortunately, many 
questions can often be answered by models as 
precise as those obtained based on pE-MAP 
data (RMSD range of 1 to 15 A). Some exam- 
ples include describing the architecture and 
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evolution of protein assemblies (8, 4), design- 
ing interface mutations (42), characterizing 
structural heterogeneity of protein complexes 
(42, 43), and mapping binding-induced struc- 
tural changes (44). Importantly, the estimate 
of 35 to 40 mutations is an upper bound, and, 
in many cases, the number might be reduced 
by specifically exploiting point mutations that 
target surface residues and/or residues known 
to be functionally important and by choosing 
substitutions likely to give rise to functional 
perturbations. The outcome of these calculations 
indicates the utility of the pE-MAP data for in- 
tegrative structure determination. 


The pE-MAP connects individual histone 
residues and regions to other associated 
complexes and processes 


To examine whether the pE-MAP can iden- 
tify interactions with complexes that are not 
stably associated with histones, we investigated. 
the relationships between modifiable histone 
residues and their cognate enzymes (modifier 
pairs). Interestingly, we observed a dramatic 
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Fig. 6. Integrative structure determination of histones H3 and H4. (A) The 
native structure of the histone H3-H4 dimer (PDB 1ID3, left) and its contact map 
(right). In contact maps, the intensity of gray is proportional to the relative 
frequency of residue-residue contacts in the models (cutoff distance of 12 A). For 
x-ray structures, the contact frequency is either O (white) or 1 (black). The 
circles correspond to the pairs of restrained residues, with the intensity of red 
proportional to the MIC value (MIC > 0.3), showing that the pairs of residues with 
high MIC values are distributed throughout the proteins. (B) The localization 
probability density of the ensemble of structures is shown with a representative 
(centroid) structure from the computed ensemble embedded within it (left) 


increase in S-scores within specific modifier 
pairs, aS compared with the overall genetic 
interaction distribution (Fig. 7A and table S4). 
The positive S-scores reflect that a modifier 
and its target residue often are epistatic or 
suppressive because they function in the same 
pathway. To test if this pattern extends to 
phenotypic profile similarities, we integrated 
the histone pE-MAP into a merged map of 
previously collected genetic interaction data 
for gene deletions and hypomorphic alleles 
(45). We computed Pearson correlation coef- 
ficients for each histone mutant phenotypic 
profile across the merged map, generating 
a correlation map of 350 histone mutants 
against 4414 whole-gene perturbations (data 
S3 and Methods). In agreement with the 
individual S-scores, the specific modifier pairs 
exhibit significantly higher phenotypic profile 
correlations than the overall map (Fig. 7A and 
table S4). These findings show that the pE- 
MAP can be used to pair specific residues to 
their respective modifiers, even though these 
are not stably associated with the histones. 
For example, when components of COMPASS, 
which methylates histone H3K4 (30, 31, 46), 
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are deleted (swdIA, swd3A, sdc1A, bre2A), we 
observe strong positive S-scores with both 
H3K4 mutants [K4R (Lys*—Arg) and K4Q 
(Lys*—Gln)] as well as high correlations of 
the phenotypic profiles between the H3K4 
mutants and COMPASS deletions (Fig. 7B 
and data S1 and S2). 

To explore these relationships in a struc- 
tural context, we developed a Cytoscape (47) 
app named stE-MAP (structure E-MAP) that 
interactively maps the genetic interactions of 
pE-MAP gene clusters onto the point-mutated 
protein structure. stE-MAP connects Cytoscape 
to ChimeraX (48) and displays connections 
between a predefined set of genes and all 
mutated residues for which the underlying 
interactions pass user-defined criteria (fig. S6A). 
We mapped the genetic connections between 
COMPASS and all histone residues with which 
it exhibits >0.2 median correlation (Methods). 
Only five residues pass this threshold, and the 
strongest connection is displayed by H3K4. 
The other four residues (H3K1 to H3K3 and 
H3K5) are proximal and are thus likely to 
interfere with the interaction between COMPASS 
and H3K4 (Fig. 7C). This finding is particularly 
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and the corresponding contact map (right). The localization probability density 
map represents the probability of any volume element being occupied by a given 
protein. (C) Distributions of accuracy (left) of structures in the ensembles 
and model precisions (right) based on the full pE-MAP dataset, resampled 
datasets that consider fractions of the data, and using shuffled MIC values. 

On the left, the white dots represent median accuracies, and on the right, 

the error bars represent the standard deviations of model precision over 

three independent realizations (shown as dots). (D) Localization probability 
density and centroid structure (left) and contact map (right) computed with 
shuffled MIC values (Methods). 


notable because these residues reside in the 
most distal region of the unstructured H3 
N-terminal tail. Given that we do not have 
COMPASS point mutations in our dataset, we 
did not attempt to model this interaction. How- 
ever, analysis of the MIC values associated with 
the H3 and H4 tails, and their relationship 
with the core domains, indicates that distance 
restraints for the histone tails could be derived 
from the pE-MAP data. Specifically, the MIC 
value distributions for the tail-core and tail-tail 
pairs of mutations are similar to that of the 
core-core mutations (fig. S5C). This similarity 
indicates that we can derive distance restraints 
for the histone tails, thus, in principle, sup- 
porting the feasibility of integrative structure 
modeling of disordered regions. In such model- 
ing, to avoid overinterpretation of the data and 
to account for a possibility that the pE-MAP 
data of the histone tails reflect interactions 
with neighboring nucleosomes, we would have 
to include multiple nucleosome copies in the 
model and allow for assignment ambiguity, 
just as we do for distances inferred from chem- 
ical cross-links and protein proximity inferred 
from affinity copurification (49). 
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Fig. 7. Connecting individual histone residues and regions to other 
associated complexes. (A) Comparison of S-scores and Pearson correlation 
coefficients of phenotypic profiles of modifier-residue pairs to the overall data. 
Only residues with a single known modifier and modifiers with a single known 
target residue were included (table S4). p values were calculated using two-sided 
Wilcoxon rank sum tests. The whiskers of the boxplots extend to a maximum 
of 1.5 x IQR (interquartile range), and outliers are not plotted. (B) Average 
distributions of S-scores (left) and phenotypic profile correlations (right) of H3K4 
mutants (mean of H3K4Q and H3K4R). Members of the COMPASS complex 
that exhibit a mean S-score >2.5 or a mean genetic interaction profile correlation 
>0.2 with H3K4 mutants are highlighted. The COMPASS complex is responsible 
for H3K4 methylation. (C) Mapping of genetic interaction profile correlations 

to COMPASS complex members on the structure of the nucleosome (modified PDB 
1ID3, data S2). N-terminal tail residues of H3 and H4 not included in 1ID3 are 
visualized as strings on the periphery. Only residues that exhibit a median genetic 
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profile correlation >0.2 with the COMPASS subunits are highlighted (Methods). 
H3 is depicted in purple, H4 in light green, and H2A/H2B and DNA in gray. 
The red color gradient reflects the strength of the correlation between each 
residue and the COMPASS members, calculated as the median correlation between 
the residue’s tested mutations and the COMPASS members. (D) Distributions 
of genetic interaction profile correlations of H3K56Q (acetylation mimic) and 
H3K56R (deacetylation mimic). Correlations of key H3K56ac-level regulators, 
Rtt109 (acetylating) and Hst3 (deacetylating), are highlighted. The cartoon 
outlines the H3K56 acetylation pathway and its role in H3 ubiquitylation. RttlO9 
acetylates H3K56 through an Asfl-dependent mechanism, which promotes 
ubiquitylation of H3 by RttlOl-Mmsl and Mms22. These five gene deletions 

are all found among the top 10 most similar to the deacetylation mimic H3K56R, 
whereas deletion of the H3K56 deacetylase Hst3 instead gives rise to a 

profile similar to the acetylation mimic H3K56Q (table inset). CC, Pearson 
correlation coefficient. 


We observe similar trends for other histone 
modifiers. For example, members of the Set2 
pathway (Set2, Eaf3, Rcol, Ctk1) (33-35) rank 
highly in the distributions of S-scores or cor- 
relations for mutations of their target residue, 
H3K36 (fig. S6, B and C). Interestingly, we 
also found instances where different mutations 
of a single residue identify connections to dif- 
ferent modifiers. For example, the phenotypic 
profile of the deacetylation mimic H3K56R is 
similar to that of deletion of R7TI09, which 
encodes the H3K56 acetylase (Pearson corre- 
lation coefficient of 0.4) (29), whereas the 
acetylation mimic H3K56Q instead correlates 
with the profile generated from the deletion 
of the corresponding deacetylase, HST3 (Pearson 
correlation coefficient of 0.35) (50) (Fig. 7D). 
H3K56R further correlates with asf1A, ritI0IA, 
mmsIA, and mms22A, whose corresponding 
proteins play key roles in the H3K56 acetyla- 
tion pathway and downstream H3 ubiquity- 
lation (51, 52) (Fig. 7D). Accordingly, the stE-MAP 
app identified strong links between Hst3-Hst4 
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and H3K56Q, as well as Rttl09-Asfl and H3K56R. 
(fig. S6D). Although we find that it is often 
informative to group different mutations of the 
same residue together, these examples high- 
light the potential of these maps for deeper 
mechanistic insights where required. 

Expanding on these findings, we built a 
gene set enrichment map connecting the 
modifiable histone residues to nuclear pro- 
cesses (fig. S7A, table S5, and Methods). We 
observe both known and previously unknown 
connections. For example, “DNA recombina- 
tion and repair” is connected to four residues, 
and two of these, H3K56 and H3K’79, have 
been shown to play key roles in yeast’s DNA 
repair (53-57). Interestingly, we find that muta- 
tions of the other two residues (H3R63K and 
H4R36K) result in increased spontaneous muta- 
tion frequencies at the URA3 locus, indicating 
that these residues also function in DNA repair 
(fig. S7, B and C; table S6; and Methods). 

The gene set enrichment analysis also 
identified 13 residues connected to cryptic 
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transcription (fig. S7A). The pE-MAP includes 
24 different mutations of these residues, and 
we tested their involvement in cryptic tran- 
scription by quantifying the abundance of 
transcripts at the 5’ and 3’ end of the STEII 
gene, using quantitative polymerase chain re- 
action (qPCR) (fig. S7D and Methods). In total, 
16 mutations, distributed among 10 residues, 
increase 3’ transcript abundance by >50% com- 
pared with WT (table S7), and nine mutants 
among five residues increase 3’ transcription 
more than twofold, without major changes in 
5’ transcription (fig. S7E). As expected, H3K36A, 
H3K36R, H3K36Q, and set2A increase 3’ trans- 
cript abundance strongly, as do mutations of 
H4K44, which is a residue known to affect 
cryptic transcription (58). Interestingly, H3K122A 
increases 3’-transcript abundance >15-fold and, 
using ATAC-seq, we find that the mutation 
gives rise to nucleosome-free regions in STE 
and other genes known to produce cryptic 
transcripts (fig. S7, F to H). H3K122A exhibits 
positive genetic interactions with deletion of the 
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histone chaperone SP72 (S-score = 2.4) and the 
nucleosome remodeling factor CHD1 (S-score = 
4.9), which are both involved in cryptic trans- 
cription (59, 60). Accordingly, we find that dele- 
tion of either SPT2 or CHDI suppresses the 
cryptic transcription phenotype observed in 
H3K122A to WT levels, even though spt2A or 
chdi1A alone has no effect (fig. S7, I to K). 


The integrative structure determination 
approach is transferable to other complexes 


To test whether genetic interaction mapping 
can be used to determine the structure of 
other complexes, we examined a pE-MAP of 
RNAPII in budding yeast (15). This pE-MAP 
consists of 53 point mutants crossed against 
a library of 1200 gene deletions and hypomor- 
phic alleles. Interestingly, the association be- 
tween MIC values and the upper distance 
bound is also apparent in this dataset (fig. S8, 
A and B), even though the protein sizes and 
mutational coverage of the polymerase system 
(up to ~1700 residues and 1 to 2%, respec- 
tively) are vastly different from those of the 
histones (<140 residues and 85 to 90%). These 
observations suggest that our parameteriza- 
tion of the pE-MAP spatial restraint based on 
the histone data may be generally applicable. 
To evaluate this expectation directly, we next 
modeled subunits Rpb1 and Rpb2 of RNAPII 
using the Bayesian likelihood parametrization 
based on the histone pE-MAP. To illustrate 
the modeling of higher-order complexes, we 
divided Rpb1 into two domains, thereby re- 
presenting the system with three rigid bodies 
(Methods and table S8). We obtained a model 
with a mean accuracy of 16.8 A and precision 
of 9.8 A (Fig. 8, A to D, and table S8). This 
positive result illustrates the generality of the 
pE-MAP-based spatial restraints. 

To further assess the utility of pE-MAP data 
for structure determination, we compared the 
RNAPII model obtained using the pE-MAP to 
a model using 22 previously published chem- 
ical cross-links (67). Cross-linking is widely 
used for integrative structure determination 
of macromolecular assemblies (2, 8). Inter- 
estingly, a model of yeast RNAPII based on 
the pE-MAP data is as accurate as that based 
on the cross-links (16.8 and 16.7 A, respec- 
tively; Fig. 8D). Moreover, the accuracy and 
precision of the model improves if both data- 
sets are used simultaneously (10.2 and 3.7 A, 
respectively; Fig. 8D), indicating complemen- 
tarity between the two types of data and de- 
monstrating a premise of integrative structure 
determination (Fig. 5). Although a cross-link 
between two residues may provide more direct 
structural information than the corresponding 
pE-MAP pair, the number of possible cross- 
links is limited by the number of proximal 
reactive residue pairs, whereas the number of 
pE-MAP pairs grows quadratically with every 
additional point mutation introduced. There- 


Braberg et al., Science 370, eaaz4910 (2020) 


fore, the larger number of less precise pE-MAP 
restraints can lead to a more accurate model than 
a smaller number of more precise cross-links. 


The integrative structure determination 
approach is transferable to other types of 
phenotypic profiles 


To examine the applicability of our approach 
to other types of phenotypic profiles, we turned 
to a CG-MAP of 44 bacterial RNAP point muta- 
tions exposed to 83 different environmental 
stresses (e.g., chemical perturbations, temper- 
ature stress, and pH change) (20). We observe 
an association between MIC values and the 
upper distance bound, similar to that of the 
pE-MAP datasets (fig. S8 and Methods). We 
modeled the structure of subunits RpoB and 
RpoC of the bacterial RNAP with a mean ac- 
curacy of 15.0 A and precision of 6.6 A (Fig. 8, 
E to H, and table S9). This result suggests that 
maps with relatively small numbers of orthogo- 
nal phenotypes per point mutation can be used 
to accurately predict the architecture of macro- 
molecular assemblies. Considering that construct- 
ing large gene deletion libraries and crossing 
them against point mutations can be laborious, 
environmental phenotypic profiles may be a more 
efficient alternative for generating spatial re- 
straints for integrative structure determination 
than genetic interaction phenotypic profiles. 


Spatial restraints derived from pE-MAP 
data are comparable to other commonly 
used data types 


Coevolution information can also be used to 
predict the structure of protein assemblies 
(18, 62, 63). However, the success of such 
modeling is heavily dependent on the number 
of sequences in the input sequence alignments 
and the ability to discriminate interacting from 
noninteracting homologs in genomes with mul- 
tiple paralogs (64). Using the RaptorX protein 
complex contact prediction server (65, 66), 
we predicted the interfacial contacts between 
RpoB and RpoC of the bacterial RNAP; the 
numbers of homologous sequences were in- 
sufficient for the yeast histones and RNAPII 
(Methods). Importantly, RaptorX is based on a 
combination of coevolution analysis and a 
deep-learning algorithm that reduces the re- 
quirement for sequence homologs and improves 
accuracy (67). Other commonly used coevolu- 
tion methods (18, 62) did not identify any 
interfacial contacts. Similar to the pE-MAP 
and CG-MAP datasets, we observe a negative 
Statistical association between the residue pair 
coupling strengths and the upper distance 
bounds (fig. S9). To mimic the pE-MAP restraint, 
we converted the top coupling strengths into 
upper distance bound restraints (Methods). The 
model ensemble computed from coevolution- 
derived restraints includes two different sets 
of configurations (mean accuracy of 22.6 A; 
model precision of 9.0 A; Fig. 8H). Only a frac- 
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tion of the bacterial RNAP structures com- 
puted using coevolution-derived restraints are 
as accurate as those computed using the CG- 
MAP restraints. The model precision and accu- 
racy of the model improve slightly if both types 
of restraints are combined (mean accuracy of 
14.5 A; model precision of 6.5 A; Fig. 8H). 


Discussion 


We show that the architectures of macro- 
molecular assemblies can be determined using 
quantitative genetic interaction data collected 
in vivo. The accuracy and precision of such 
models are comparable to those of models 
based on chemical cross-linking or coevolu- 
tion analysis. A key premise of integrative 
modeling is that using several different types 
of data improves the accuracy and precision of 
the model. Because the pE-MAPs and CG-MAPs 
contain purely phenotypic measurements, 
collected in living cells, these datasets gener- 
ate spatial restraints that are orthogonal to 
other commonly used data for integrative 
modeling. Because these data reflect in vivo 
structures, and are thus unlikely to share 
artifacts of biophysical methods, they could 
be of particularly high value in the integrative 
modeling process. The genetic interaction 
data may also allow for the characterization of 
complexes that are difficult to isolate and 
purify or those that are only transiently stable. 
Importantly, the equipment required for 
generating these data is basic, and, in partic- 
ular, the CG-MAPs can be generated efficiently. 
Recent developments in CRISPR-Cas9 based 
approaches have paved the way for multiplexed 
precision genome editing in yeast (68), allow- 
ing for rapid generation of CG-MAPs. Together, 
these methods make feasible the proteome- 
wide modeling of protein complex structures, 
guided by global protein-protein interaction 
maps (69). In addition to proteins, the approach 
is also applicable to assemblies containing 
nucleic acids, thus further expanding the scope 
of integrative structural biology. pE-MAPs and 
CG-MAPs are complementary to other high- 
throughput functional assays. For example, two 
recent studies have shown that deep muta- 
tional scans can be used for determining the 
fold of a small protein domain (70, 71). If these 
methods prove useful for multidomain proteins 
or disordered regions (72), then the measured 
phenotype changes could be used to derive 
additional restraints for the integrative struc- 
ture determination. 

The relationship between phenotypic pE-MAP 
measurements and structure can be uncertain. 
The reasons for this include mutations in distant 
positions that are part of an allosteric network 
and could give rise to similar profiles, mutations 
that are functionally irrelevant, and mutations 
that perturb gene expression, MRNA stabil- 
ity, or translation. Additionally, the approach 
relies on the introduction of point mutations 
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Fig. 8. Integrative structure A 
determination of yeast RNAPII 
and bacterial RNAP. (A) The 
native structure of Rpbl-Rpb2 
(PDB 2E2H) showing its three 
rigid-body components. Rpbl was 
split into two domains, as shown. 
(B) The localization probability 
density of the ensemble of the 
three rigid-body structures is 
Shown with a representative (cen- 
troid) structure from the com- 
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RpoB-RpoC structures with a rep- 
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from the computed ensemble 
embedded within it. (G) Contact 
maps computed for the x-ray 
structure (top) and model using 
the CG-MAP dataset (bottom). 
The shaded yellow band repre- 
sents a region missing in the x-ray 
structure. (H) Distributions of 
accuracy (top) for all structures in 
the ensemble and model preci- 
sions (bottom) for the ensembles 
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into the proteins of interest, which may result 
in structural changes. However, proteins often 
adapt to mutations by small local changes in 
their structure, maintaining their overall fold 
and function (73). Mutations that cause major 
misfolding of essential proteins and/or assem- 
blies are uncommon in pE-MAPs because the 
resulting fitness defects typically prevent suc- 
cessful screening. The method could be im- 
proved by specifically designing point mutants 
that do not alter the structure and/or lead to 
aggregation, by selecting commonly allowed 
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mutations as determined by divergent protein 
sequence alignments (74). 

The aim of integrative structure determi- 
nation is to model the structures of macro- 
molecular assemblies. This often requires the 
structures of the individual components [from 
x-ray crystallography, NMR, cryo-EM, com- 
parative modeling, or, increasingly, ab initio 
structure prediction (65, 67, 75, 76)|. The qua- 
lity of the structures of the individual compo- 
nents and input data are crucial for integrative 
(or indeed any other) structural approaches, 
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and one cannot achieve a precise structure 
from low-quality starting structures or data. 
Even so, there are numerous examples of 
utility of structural models at lower resolution 
(3). For example, these models can be used to 
explain the architectural principles of large 
assemblies (8, 41, 77, 78), describe the struc- 
tural dynamics of protein complexes (42, 43), 
or rationalize the impact of many mutations 
(41). A lower-resolution structure is also often 
a useful starting point for higher-resolution 
structure characterization. 
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CRISPR-Cas9 genome editing (79) has pro- 
ven highly effective for high-throughput ge- 
netic interaction mapping in mammalian cells 
(80, 81). To date, these efforts have relied on 
whole-gene perturbations, but methods for 
systematic generation of point mutants using 
CRISPR-Cas9 have recently been developed 
(82, 83), paving the way for mammalian pE-MAP 
screening. This advance provides a means for 
integrative structure determination of assem- 
blies in human cells and also allows for id- 
entification and characterization of functionally 
relevant structural changes that take place in 
disease alleles. Expanding this analysis to host- 
pathogen complexes (84-86) will be feasible by 
introducing specific mutations into the patho- 
genic genome and studying the phenotypic 
consequences using genetic interaction profil- 
ing of relevant host genes (87). Furthermore, 
several efforts are under way to generate multi- 
scale models of entire cells (88-92). In such in- 
stances, high-throughput genetic interaction 
mapping could provide global insights into 
cellular organization and dynamics of differ- 
ent components while also informing on the 
structures of individual assemblies. 


Methods 
Histone mutant strain construction 


The histone H3 and H4 mutant strain library 
was constructed essentially as described (22, 23). 
Briefly, the mutants (tail deletions, complete 
alanine-scan, and context specific point muta- 
tions) were generated in the YMS196 back- 
ground (MATo his3A leu2A ura3A canl:: 
STE2pr-spHIS5 lyp1::STE3pr-LEU2) (table 
S1). First, the base strains were created by 
replacing the HHT2-HHF?2 locus with a URA3- 
containing cassette carrying a mutated HHT2- 
HHF? locus with their endogenous promoters. 
We randomly picked a few base strains, and 
the mutated HHT2-HHF2 loci were PCR am- 
plified and validated by sequencing. Then, the 
HHTI-HHF] locus was replaced with a HYGË- 
containing cassette carrying a mutated version 
of the HHTI-HHF1 locus, resulting in pE-MAP- 
amenable strains (Mata his3A leu2A ura3A 
hhtL-hhfl:sHYG* hht2-hhf2::URA3 cani::STE2pr- 
SpHIS5 lypi::STE3pr-LEU2). 


Histone mutant library validation 


Libraries were validated in three steps: (i) Each 
mutant was constructed, transformed into bac- 
teria, and sequenced; 100% sequence identity 
was required to pass quality control. (ii) After 
the correct integration of histone mutants in 
the HHT2-HHF2 locus, 5 to 10 yeast base strains 
from each 96-well plate were randomly selected, 
and corresponding histone fragments were 
amplified and sequenced to ensure the iden- 
tity of each mutant in the well and no cross- 
contamination during plasmid preparation and 
yeast transformation; 100% of these were cor- 
rect. (iii) After obtaining the yeast library with 
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the second (HYG®) cassette integrated, 5 to 
10 yeast strains from each 96-well plate were 
also randomly selected. Both copies of histone 
mutants in each strain were amplified and se- 
quenced to confirm the identity of mutations. 
All of them were correct. 


pE-MAP analysis 


Each of the histone H3 and H4 mutant strains 
was crossed with 1370 MATa KANË marked 
deletion (nonessential genes) or DAmP (de- 
creased abundance by mRNA perturbation; 
essential genes) strains by pinning on solid 
media as described (15). Sporulation was in- 
duced, and MATa haploid spores were selected 
by replica plating onto media containing cana- 
vanine (selecting canIA haploids) and S-AEC 
(selecting lypIA haploids) and lacking histidine 
(selecting MATa spores). Triple-mutant haploids 
were isolated on media containing hygromycin 
(selecting hht-hhf1 mutant cassette) and G418 
(selecting KAN? marked deletion or DAmP), 
and lacking uracil (selecting hhi2-hhf2 mutant 
cassette). Finally, triple-mutant colony sizes were 
extracted using imaging software. The screen 
was carried out in three biological replicates 
with three technical replicates in each. Four 
mutants (H4E73Q, H4H18A, H4121A, and 
H4K44.Q) failed screening in one biological 
replicate, and the results for these are based 
on the two successful replicates. Detailed E-MAP 
experimental procedures are described in 
(26, 29, 93). Genetic interactions were quanti- 
fied using S-scores (28), which are closely 
related to ¢ values. The S-score quantifies the 
deviation of the double (or triple) mutants 
from the expected combined fitness effects of 
the individual mutants and incorporates the 
reproducibility between technical replicates. 
The published S-scores represent the average 
S-scores across biological replicates. 


Design of the pE-MAP spatial restraints 


The distance restraint based on pE-MAP data 
was designed using the 308 single-point mu- 
tants from the histone pE-MAP and the structure 
from the PDB 1ID3, as follows: (i) postprocessing 
of the genetic interaction phenotypic profiles, 
Gi) devising a phenotypic similarity metric þe- 
tween the phenotypic profiles, and (iii) design- 
ing spatial restraints for integrative structure 
modeling using the phenotypic similarity values 
and the known nucleosome x-ray structure. Next, 
we describe each of these three steps in turn. 
(i) Postprocessing of the genetic interaction 
phenotypic profiles: All missing values in the 
pE-MAP were imputed as the mean of the 
S-scores between the corresponding deletion 
mutant and all histone point mutants. To in- 
crease the signal-to-noise ratio of the pE-MAP, 
gene deletion mutants that mostly exhibited 
weak genetic interactions with the histone 
mutants were filtered out. To this end, the 
gene deletion profiles were ranked in descend- 
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ing order based on the counts of their S-scores 
that fell in either the top 2.5% of positive 
S-scores or the bottom 5% of negative S-scores, 
from the complete point mutant pE-MAP (cut- 
offs calculated after imputation). The more 
stringent cutoff for positive S-scores was 
chosen to reflect the smaller dynamic range 
for positive genetic interactions compared to 
negative genetic interactions. Gene deletions 
with the same count were then ranked in 
descending order by the mean of the absolute 
values of their highest and lowest score (fig. 
S3A). The top fraction of the deletions, deter- 
mined in step (iii) below, were retained for 
computing the histone point mutant pheno- 
typic profile similarities (below, fig. S3). 

(ii) Devising a phenotypic similarity metric 
between the phenotypic profiles: We computed 
the similarity between all pairs of histone 
phenotypic profiles using the maximal infor- 
mation coefficient (MIC; fig. S3B), with the 
MIC parameters alpha and c set to 0.6 and 
15, respectively, as suggested (36, 37). Many 
positions in the histones were mutated to sev- 
eral different residue types, giving rise to sev- 
eral phenotypic profiles for each of these 
positions. As a result, more than one MIC value 
would often be computed for a single residue 
pair. In such cases, only the highest MIC value 
was retained. 

(ii) Designing spatial restraints for integra- 
tive structure modeling: Using the histone 
x-ray structure [PDB 1ID3; (38)], we measured 
the Ca-Ca distance between all pairs of resi- 
dues for which we computed a MIC pheno- 
typic similarity score. The percentage of the 
top scoring phenotypic profiles [ranked by 
the genetic interaction scores; step (i)] re- 
tained for further analysis was determined as 
follows. We compared the statistical associ- 
ation of the distances between two mutated 
residues with their phenotypic similarity by 
selecting the top 10, 25, 50, and 100% of the 
ranked deletions (fig. S3C). Although MIC 
values between phenotypic profiles do not 
linearly correlate with the distances spanned 
by the mutated residues in the WT structure 
(Pearson correlation coefficient of -0.07 when 
using the top 25% or top 50% of deletions), the 
MIC values provide an upper distance bound 
between the residues. The upper distance bound 
was obtained by binning the MIC values into 
20 intervals and selecting the maximum dis- 
tance spanned by any pair of residues in each 
bin, followed by fitting a logarithmic decay 
function (dy) to the upper distance bounds: 


dy (MIC) 
log( MIC) — 
— pe if MIC < 0.6 (1) 
6.84 if MIC > 0.6 


where k and n are —0.0147 and —0.41, respec- 
tively (fig. S3C). We find that selecting the top 
25 or 50% of the deletions had a comparable 
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association between the upper distance bounds 
and the computed MIC values. The association 
was determined by computing the R and p 
values of the Pearson correlation coefficient 
and association significance, respectively, for 
the log-transformed MIC values. In this work, 
we retained the top 25% of the ranked pheno- 
typic profiles for computing the phenotypic 
profile similarities. 

To effectively handle the uncertain relation- 
ship between the data and model, we use 
Bayesian inference for scoring alternative 
models by formulating spatial restraints as 
Bayesian data likelihoods (94). Formally, the 
posterior probability of model M given data 
D and prior information I is p(M|D,I) 
p(D|M,I) -p(M|I). The model, M, consists 
of a structure X and unknown parameters Y, 
such as noise in the data. The prior p(M|I) is 
the probability density of model M given J. 
The prior can in general reflect information 
such as statistical potentials or a molecular 
mechanics force field; here, we only used 
excluded volume and sequence connectivity. 
The likelihood function p(D|M, T) is the prob- 
ability density of observing data D given 
Mand I. The pE-MAP data was used to com- 
pute phenotypic similarities (i.e., MIC values) 
that inform distances between mutated re- 
sidues pairs 7,7. The likelihood of the entire 
pE-MAP dataset is the product over the in- 
dividual observations between residue pairs 
i,j:p(D|M, I) = [I], Via ;| fij(X), 0:3], where 
Jij(X) is a forward model that predicts the data 
point di; in D that would have been observed 
for structure X in an experiment without noise; 
N|di;| fij (X), Siz] is a noise model that quan- 
tifies the deviation between the predicted and 
observed data points. 

We defined the forward model by inverting 
the relation between the upper distance bound 
and observed MIC values [dy(MIC), Eq. 1]: 


Fig(X) = MIC(d;;) 
E oe if dij < do (2) 
~~ 0.6 if dij > do 


where do = dy (0.6). Our choice of a noise 
model is a lognormal distribution with a flat 
plateau for MIC values below the upper bound 
on the experimentally observed MIC values 
(MICS): 


P(MIC?P |MIC,,;,X, 073) 
1 


` if MIC? >=MIC;; 
=< 4 1 f ..g MIC |. o 
exp log if MIC??* < MIC;; 
M [2n MIGF | 205; MIG; E (3) 





Here, o;; are the noise parameters that can 
optionally be determined as part of the model, 
and N and M are normalization factors neces- 
sary to make the likelihood continuous. Log- 
normal noise models have previously been 
used to describe errors of inherently positive 
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quantities (95). For computational efficiency, 
we used a single o value for all residue pairs. 
An uninformative Jeffrey’s prior is applied to 
o to represent a lack of information on the 
bounds and distribution of this parameter (96). 

Finally, a Bayesian term in the scoring 
function is defined as the negative logarithm 
of the posterior probability density: S(M) = 
—logp(M|D, I). In the Bayesian view, the out- 
put model is in fact best equated to the poste- 
rior model density that specifies a distribution 
of alternative single models M with varying 
probability density, not a single model, although 
single representative or average models can 
always be proposed based on the posterior 
model density. 


Calculation of similarity metrics for yeast 
RNAPII and bacterial RNAP datasets 


Steps (i) and (ii) from “Design of the pE-MAP 
spatial restraints” were repeated for the yeast 
RNAPII and bacterial RNAP datasets to gene- 
rate the similarity metrics (MIC values) for these 
two systems, with the following modifications: 

For yeast RNAPII, before imputing missing 
values in the pE-MAP, any deletion mutants 
that exhibit missing values with more than 
15% of the point mutants were filtered out. 
This step is part of our pipeline but had no 
effect on the histone pE-MAP (because this 
pE-MAP does not contain any deletion mutant 
with more than 15% values missing). The num- 
ber of ranked deletion mutants retained at the 
end of pE-MAP postprocessing was chosen to be 
25% of the number of deletions in the original 
unfiltered pE-MAP (in accordance with the 
histone pE-MAP processing). 

For bacterial RNAP, owing to the very small 
number of perturbations in this dataset, all the 
perturbations (instead of the top 25%) were 
retained for computing point mutant pheno- 
typic profile similarities. In addition, owing to 
differences in the experimental design for 
generating the yeast pE-MAPs and the bacte- 
rial RNAP CG-MAP, the bacterial RNAP MIC 
distribution had a ~2-fold higher median and 
greater spread than the other datasets. Corre- 
spondingly, the bacterial RNAP MIC dis- 
tribution was normalized using linear scaling, 
decreasing its median to match that of the 
histone MIC distribution. Importantly, this step 
was based solely on the MIC distributions, 
without reliance on any structural information. 


Integrative structure determination 


Integrative structure determination for each 
system proceeded through the standard four 
stages (3, 4, 5, 8, 41, 97) (Fig. 5 and tables S3, 
S8, and S9): (i) gathering data, (ii) represent- 
ing subunits and translating data into spatial 
restraints, (iii) configurational sampling to 
produce an ensemble of structures that satisfies 
the restraints, and (iv) analyzing and validating 
the ensemble structures and data. The integra- 


11 December 2020 


tive structure modeling protocol [i.e., stages (ii), 
(iil), and (iv) was scripted using the Python 
Modeling Interface (PMI) package, a library 
for modeling macromolecular complexes based 
on our open-source Integrative Modeling Plat- 
Jorm (IMP) package (5), version 2.8 (https:// 
integrativemodeling.org). Files containing the 
input data, scripts, and output results are 
available at https://integrativemodeling.org/ 
systems/pemap and the nascent integrative 
methods benchmarking section of the world- 
wide Protein Data Bank (wwPDB) PDB-Dev 
repository for integrative structures and cor- 


responding data (http://pdb-dev.wwpdb.org) (98). 
(i) Gathering data 


To mimic realistic integrative structure deter- 
mination, we did not rely on the known atomic 
structures of the subunits in the actual modeled 
complex [correct docking of exact bound struc- 
tures based on geometric complementarity is 
easy, (99)]. Instead, we computed compara- 
tive models of histones H3 and H4 based on 
their alignments with structures of the ITZY 
(100) (89 and 92% sequence identity, respec- 
tively), using MODELLER, version 9.21 (0D). 
The Ca-atom RMSDs between the crystal struc- 
tures and comparative models is 2.8 and 5.5 A 
for H3 and H4, respectively, corresponding to 
medium- and low-accuracy comparative models. 
The second major input information source was 
a pE-MAP dataset of 308 point mutations in 
histones H3 and H4 crossed against an array 
of ~1370 gene deletion alleles, resulting in 
946 MIC values above 0.3. Of these, 170 MIC 
values were converted into distance restraints 
between H3 and H4 residues (fig. S8 and 
table S3). 

Comparative models of subunits Rpbl and 
Rpb2 of yeast RNAPII were computed based 
on template structures 6GMH (102) (54% se- 
quence identity) and 4AYB (103) (43% sequence 
identity), respectively. The Ca RMSD between 
the crystal structures of subunit Rpb1 and Rpb2 
[2E2H (J04)] and their comparative models are 
7.3 and 5.2 A, respectively. A pE-MAP dataset of 
53 single point mutants in yeast RNAPII (44 of 
which reside in subunits Rpb1 and Rpb2) and a 
library of ~1200 gene-deletions resulted in 
195 MIC values above 0.3. Of these, 123 MIC 
values were converted into distance restraints 
(fig. S8 and table S8). In addition, we com- 
pared the RNAPII model based on the pE-MAP 
to a model based on 22 previously published 
chemical cross-links (67). 

The structures of subunits RpoB and RpoC 
of bacterial RNAP were obtained from the x-ray 
structure of the entire complex (4YG2) (105). A 
CG-MAP of 44 single point mutants of the two 
subunits and a library of 83 conditions (e.g., 
treatments with chemicals and temperature 
shocks) resulted in 109 MIC values above 0.3. 
Of these, 63 MIC values were converted into 
distance restraints between the subunits 
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(fig. S8 and table S9). In addition, we com- 
pared the bacterial RNAP model based on the 
CG-MAP to a model computed based on dis- 
tance restraints derived from the interfacial 
contacts predicted using the RaptorX protein 
complex contact prediction server (65, 66) 


(ii) Representing subunits and translating 
data into spatial restraints 


To maximize computational efficiency while 
avoiding using too coarse a representation, we 
represented each complex in a multiscale fashion. 
In particular, the subunits and domains of 
each complex were coarse-grained using beads 
of varying sizes representing either a rigid body 
or a flexible string, based on the available com- 
parative models, as follows (tables S3, S8, and 
S9). The comparative models were coarse-grained 
into two representations at different resolu- 
tions. First, we identified loop regions of at 
least eight residues using DSSP (106, 107) and 
represented them by flexible strings of beads 
of up to 10 residues each. Second, for the re- 
maining residues each bead corresponded to 
an individual residue, centered at the position 
of its Ca atom. With this representation in hand, 
we next translated the input information into 
spatial restraints as follows. 

The defining and most important restraint 
for our method is extracted from the pE-MAP 
and CG-MAP data. The collected pE-MAP and 
CG-MAP MIC values were used to construct the 
Bayesian term in the scoring function that re- 
strained the distances spanned by the mutated 
residues as described above. The pE-MAP re- 
straint was applied to the one residue-per-bead. 
representation for the comparative models as 
well as to the flexible beads. To improve com- 
putational efficiency, we only considered point 
mutation pairs with MIC values greater than 
0.3. This restraint was applied to all three com- 
plexes (tables S3, S8, and S9). In addition to 
the pE-MAP data, integrative modeling can 
benefit from many other types of input in- 
formation. Here, we have supplemented the 
pE-MAP and CG-MAP data by additional 
simple terms accounting for excluded vol- 
ume and sequence connectivity. First, the ex- 
cluded volume restraints were applied to each 
bead in the one-residue (or the closest) bead 
representations, using the statistical relation- 
ship between the volume and the number of 
residues that it covered (4, 108). Second, we 
applied the sequence connectivity restraint, 
using a harmonic upper bound on the distance 
between consecutive beads in a subunit, with 
a threshold distance equal to four times the 
sum of the radii of the two connected beads. 
The bead radius was calculated from the ex- 
cluded volume of the corresponding bead, 
assuming standard protein density (4, 108). 
Moreover, we evaluated the utility of the pE-MAP 
and CG-MAP data by considering two addi- 
tional types of restraints. First, the 22 previ- 
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ously determined BS3 RNAPII cross-links (67) 
were used to construct a Bayesian term that 
restrained the distances spanned by the cross- 
linked residues (30 A) (109, 110). The cross-link 
restraints were applied to the one residue-per- 
bead representation for the comparative mod- 
els as well as flexible beads, only for RNAPII 
(table S8). Second, we applied the evolutionary 
coupling restraints to determine the structures 
of the RpoB and RpoC subunits of bacterial 
RNA polymerase. Coupling strengths between 
residue pairs were obtained using the RaptorX 
ComplexContact server (http://raptorx.uchicago. 
edu/ComplexContact/) (65, 66) with default pa- 
rameters. The top L/50 coupling strengths 
(fig. S9) with sequence separation of three or 
greater were converted into distance restraints 
using a harmonic upper bound on the distances 
between the residues. The threshold distance 
was set to 12 A. This restraint was applied only 
to a subset of bacterial RNAP modeling in- 
stances (table S9). 


Configurational sampling to produce 
an ensemble of structures that satisfy 
the restraints 


The initial positions and orientations of rigid 
bodies and flexible beads were randomized. 
The generation of structural models was per- 
formed using Replica Exchange Gibbs sampl- 
ing, based on the Metropolis Monte Carlo (MC) 
algorithm (1/0, 111). Each MC step consisted of 
a series of random transformations (i.e., rota- 
tion and translation) of the positions of the 
flexible beads and rigid bodies. Details about 
the MC runs for each system are in tables S3, 
S8, and S9. 


Analyzing and validating the ensemble 
structures and data 


Model validation follows four major steps 
(3, 112): (i) selection of the models for valida- 
tion, (ii) estimation of sampling precision, (iii) 
estimation of model precision, and (iv) quanti- 
fication of the degree to which a model satisfies 
the information used to compute it. These 
validations are based on the nascent wwPDB 
effort on archival, validation, and dissemina- 
tion of integrative structures (98, 113). We now 
discuss each one of these validations in turn. 

(i) Selection of models for validation: The 
first step is to objectively define the ensemble 
of models that will be further analyzed. For 
each trajectory, we automatically determined 
the MC step at which all data likelihoods and 
priors have equilibrated (run equilibration 
step), and all prior frames are discarded (1/4). 
Discarding the initial, nonequilibrated steps of 
each run is helpful because nontypical early 
configurations (e.g., a random configuration 
of beads, an extended configuration of beads, 
and beads far apart from each other) are re- 
moved from the statistical sample used for 
posterior model estimates. 
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With this ensemble of sampled structures 
and their corresponding scores in hand, we 
analyze the data likelihoods and priors. We 
used HDBSCAN clustering, a hierarchical 
density-based clustering algorithm, to iden- 
tify all high-density regions in the likelihoods 
and priors (115). If a single cluster was iden- 
tified, we consider all the models after discard- 
ing the initial steps; otherwise, we consider all 
models in the clusters that satisfy the input 
information, within the uncertainty of the 
data, for further analysis (below). 

(ii) Estimation of sampling precision: Next, 
we estimate the precision at which sampling 
sampled the selected structures (sampling 
precision) (772); the sampling precision must 
be at least as high as the precision of the 
structure ensemble consistent with the input 
data (model precision). As a proxy for testing 
the thoroughness of sampling, we performed 
four sampling convergence tests: (i) verify that 
the scores of refined structures do not continue 
to improve as more structures are computed, 
(ii) confirm that the selected structures in 
independent sets of sampling runs (sample 
A and sample B) satisfy the data equally well, 
(iii) cluster the structural models and deter- 
mine the sampling precision at which the 
structural features can be interpreted (fig. S10), 
and (iv) compare the localization probability 
density maps for each protein obtained from 
independent sets of runs. Details about all the 
tests are described in (112). For each modeling 
instance, the results from the convergence 
tests are summarized in tables S3, S8, and S9. 

(iii) Estimation of model precision: In the 
third step, the model uncertainty (precision) 
is estimated. The most explicit description of 
model uncertainty is provided by the set of all 
models that are sufficiently consistent with 
the input information (i.e., the ensemble). 
Model precision can be quantified by the vari- 
ability among the models in the ensemble; in 
the end, the ensemble can be described by 
one or more representative models and their 
uncertainties. For example, if the structures 
in the ensemble are clustered into a single 
cluster, the model precision is defined as the 
RMSD between models in the cluster. Impor- 
tantly, the uncertainty may not be distributed 
evenly across the ensemble, such that some 
regions are determined at a higher precision 
than others. 

(iv) Quantification of the degree to which a 
model satisfies the data used to compute it: An 
accurate structure needs to satisfy the input 
information used to compute it; all structures 
at computed precision that are consistent with 
the data are provided in the ensemble. A pE- 
MAP derived restraint is satisfied by a cluster 
of structures if the corresponding Ca-Ca dis- 
tance in any of the structures in the cluster is 
lower than the distance predicted by the MIC 
value (Eq. 1). A BS3 cross-link restraint is satisfied. 
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by a cluster of structures if the corresponding 
Ca-Ca distance in any of the structures in the 
cluster is less than 30 A (116). The remainder 
of the restraints are harmonic, with a specified 
standard deviation. Therefore, a restraint is 
satisfied by a cluster of structures if the res- 
trained distance in any structure in the cluster 
is violated by less than three standard devia- 
tions, specified for the restraint. Tables S3, S8, 
and S9 show that all models satisfy the input 
information within its uncertainty. 


Benchmark 


To benchmark the four-stage protocol described 
above, we computed the distribution of the 
accuracy for each structure in the ensemble of 
solutions obtained by integrative modeling. 
The accuracy is defined as the mean of Co 
RMSD between the x-ray structure and each of 
the structures in the ensemble. The PDB ac- 
cession code and accuracies for each modeling 
instance are summarized in tables S3, S8, and S9. 

To assess the information content of the 
histone pE-MAP, we computed the models of 
the H3-H4 complex based on random subsets 
of the data. To this end, from the dataset of 
computed MIC values for pairs of mutated 
residues, we performed three independent 
random selections of 80, 60, 40, and 20% of 
the data each. As expected, the more pE-MAP 
data that are used, the more accurate and 
precise are the models (Fig. 6C). 

Finally, as another test, we computed the 
model based on datasets with randomly shuffled 
MIC values for the same pE-MAP or CG-MAP 
residue pairs, for each of the complexes. 


Estimation of the number of mutations 
per protein 


To estimate the suggested number of mutated 
positions per protein for integrative structure 
determination, we computed the number of 
mutations that would result in four or more 
MIC values above a 0.4, 0.45, 0.5, or 0.55 
threshold. Based on our scoring function, MIC 
values above these thresholds will result in 
distance restraints with an upper distance 
bound in the 12- to 34-A range. These dis- 
tances are comparable to the upper distance 
bounds used for chemical cross-links (e.g., DSS, 
DSSO, EDC). A previous systematic study es- 
tablished that at least four chemical cross-links 
are needed to determine the binding mode of 
protein dimers if the subunit structures are 
known (e.g., from x-ray, NMR, or comparative 
models) (710). In general, adding more chem- 
ical cross-links does not further improve the 
accuracy, although it increases the precision of 
the resulting ensemble. By analogy, we esti- 
mate that, for systems in which the structures 
of the components are known, a good num- 
ber of mutations per protein is 35 to 40 (fig. 
S5A). This data can be used as a guideline to 
decide on the number of mutations to use for 
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generating a pEëE-MAP or CG-MAP. Important- 
ly, this estimate is an upper bound on the 
number of mutations, and in many cases, the 
number might be smaller for the following two 
reasons. First, this estimation was done as- 
suming protein-wide mutations of residues, 
often to alanine. In practice, the number of 
necessary mutations can be reduced by specif- 
ically designing point mutations that target 
surface residues and/or residues known to 
be functionally important, and by choosing 
substitutions likely to give rise to functional 
perturbations. In general, we did not find a 
correlation between the secondary structure 
of the residue pairs and their associated MIC 
value (fig. S5B). Second, this estimation only 
relies on the residue pairs with high MIC 
values. In contrast to chemical cross-links, the 
upper distance bound of pE-MAP derived re- 
straints are obtained from the statistical asso- 
ciation between the MIC values and distance 
between residues. Consequently, residue pairs 
with low MIC values still carry structural in- 
formation, even if at low resolution. Con- 
sistent with these considerations, the RNAPII 
dataset contains only 31 and 9 mutated resi- 
dues for Rpbl and Rpb2, respectively. Simi- 
larly, the bacterial RNAP dataset contains 23 
and 15 mutated residues for Rpob and Rpoc, 
respectively. 


Docking 


To assess the relative value of pE-MAP re- 
straints for structure determination, we com- 
puted the structures of the H3-H4 and RNAPII 
complexes by molecular docking. Specifically, 
we followed an integrative docking protocol 
(117) using the rigid-body docking program 
PatchDock (39). In each case, we used the same 
comparative models and rigid-body definitions 
used for integrative modeling (figs. S4 and S11) 
and default parameter values (figs. S4 and S11). 


Visualizations 


The pE-MAP was hierarchically clustered in 
both histone mutant and gene deletion dimen- 
sions using Cluster 3.0 (118) and displayed 
using Java Treeview (119) (Fig. 3). Images 
highlighting histone residues in context of the 
nucleosome structure (PDB 1ID3 or its modified 
version in data S2) were created using ChimeraX 
(Figs. 3A and 7C; and figs. SID and S6) (48). 


Distance of clustered pE-MAP profiles 


First, all histone alleles affecting residues not 
included in the structural reference (PDB 11D3, 
H3A1-H3K37 and H4S1-H4R17) were removed 
and the remaining data (n = 222) clustered 
hierarchically using Cluster 3.0 (1/8). For each 
node of the clustergram, the mean distance 
among member residues was calculated and 
plotted versus the normalized branch length 
(where the first node is set to branch length = 
O and the last node to branch length = 1) of the 
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respective node (fig. S2A, red dots, and random 
distribution plotted in black). 


Generation of the correlation map 


Pearson correlation coefficients were computed 
for each of the 350 H3 and H4 mutants against 
all genes and alleles (rows) in a merged map of 
previously published genetic interaction data 
[dataset S4 from (45)]. If the overlap between 
a histone mutant and a S-score vector from the 
merged map was <150 scores, the resulting 
correlation was not considered (i.e., replaced 
by “NaN”) (data S3). Pearson correlation co- 
efficients were chosen over MIC for this analysis 
because we found Pearson correlation more 
robust than MIC when many missing values 
were present. 


Structural mapping of genetic 
interactions—stEMAP app 


The hierarchically clustered pE-MAP data was 
imported into Cytoscape (47), creating an ini- 
tial network, and then linked to a modified 
version of the nucleosome structure 1ID3 (data 
S2) using the stEMAP app, developed to facili- 
tate interactive exploration of the pE-MAP. The 
original nucleosome structure was modified 
by adding the N-terminal disordered regions 
of histone H3 and H4 and manually position- 
ing them for clarity. The linking proceeds 
as follows: First, the structure is opened in 
ChimeraX (48) by structureVizX (120) and 
positioned in response to commands from 
the stEMAP app. Then, a residue interaction 
network (RIN) is created by the structureV1zX 
app where nodes are positioned to reflect the 
nucleosome structure through the help of the 
RINalyzer app (127). Finally, the RIN network 
and the network created by the original cluster 
files are merged, and edges are drawn between 
genes and residues with interactions passing a 
user-defined threshold (fig. S6A). All of the 
preceding steps happen automatically through 
the stEMAP app interface, which takes as 
input any given PDB file and a short user- 
defined JSON configuration file defining inter- 
action thresholds (here: correlation > 0.2), colors 
of edges (here: color-gradient from white to red 
for positive correlations), and display style of the 
structure in ChimeraX. 

Selection of individual genes triggers the 
interacting residues to be selected, and, in the 
ChimeraX window, those residues are shown 
as space-filling atoms, which are colored accord- 
ing to the edge colors. When multiple genes 
are selected (e.g., genes belonging to the same 
complex), there might be multiple edges con- 
necting an individual residue. In this case, the 
color reflects the significance and consistency 
of the interactions (see below). To assist in 
interpretation and interactive exploration of 
complex data sets (i) colors are quantized into 
10 bins, five positive and five negative; (ii) a 
heatmap is presented that shows only the 
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values for the selected genes and their inter- 
acting residues; (ili) sets of genes belonging to 
a complex can be selected using the setsApp 
(122); and (iv) a slider provides a filter to re- 
strict the selection to only those mutations 
with a minimum number of interactions. 

To determine if a gene set is connected to a 
given residue, the stEMAP app calculates the 
median Pearson correlation coefficient across 
all genes of the gene set against all different 
mutations at that residue. If this median cor- 
relation is above the threshold of 0.2 (defined 
in the JSON file), the respective residue is colored 
according to the median. To instead determine 
if a gene set is connected to an individual 
mutation, the same method is used, except 
the median correlation coefficient is now cal- 
culated across all genes of the gene set against 
the single given mutation (instead of all muta- 
tions of the residue). 


ROC curves 


Only library deletion mutants that exist in 
both this study and the two previously pub- 
lished E-MAP datasets [Braberg et al. (15) 
and Collins et al. (29)] were included (n = 389) 
in this analysis. Based on their pE-MAP profiles, 
Pearson correlation coefficients were calculated 
for all pairwise combinations of these 389 
deletion mutants. To determine the power of 
these correlations to predict physical inter- 
actions between encoded proteins, an ROC 
curve was computed, where a physical inter- 
action between proteins was defined if their 
PE score is greater than 2 (69). From the 
Collins et al. E-MAP, query strain profiles with 
more missing data than the sparsest histone 
mutant were removed, as were query mutants 
that also existed in the library mutant set. Be- 
cause the Braberg et al. pE-MAP only includes 
53 query mutants (rows), we used subsets of 
53 query mutants each for the histone and 
Collins et al. E-MAPs when generating their 
ROC curves, to make all three systems com- 
parable. To this end, for the Collins et al. E-MAP 
and histone pE-MAP, 53 query mutant profiles 
were randomly selected 1000 times, and a ROC 
curve was generated for each run. The median 
areas under the ROC curves (AROCs) and cor- 
responding ROC curves are reported together 
with the ROC curve of the pE-MAP from 
Braberg et al. in Fig. 2F. 


RNA-seq expression analysis 


Ten ml of overnight cultures of 29 histone 
mutant strains (table S2) were harvested in 
mid-log phase [optical density at 600 nm 
(OD¢6o0) ¥ 1.0] and washed with DEPC-ddH.0O. 
RNA was extracted with hot acidic phenol as 
described previously (123). RNA-seq libraries 
were generated using the QuantSeq 3’ mRNA- 
Seq Library Prep Kit FWD for Illumina 
(Lexogen). Single-end, 50-base reads were se- 
quenced using an Illumina HiSeq 4000 se- 
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quencer. Reads were filtered for quality and 
aligned to the yeast genome using tophat (124). 
Nonunique reads and reads mapping to ribo- 
somal RNA were removed before analysis. 
Transcript counts were extracted using htseq- 
count (125), and differential expression was 
measured using the Dseq2 package in R (126). 


Identification of functional links between H3 and 
H4 mutants and biological processes 


The correlation map (data S3) was used as the 
basis for this analysis. First, a curated annota- 
tion of all genes in the correlation map relevant 
to nuclear function was devised. Biological 
process definitions for genes in nuclear pro- 
cesses were assigned manually based on litera- 
ture and annotations from previous genetic 
interaction maps (29, 127, 128) (table $5). To 
identify links between H3 and H4 residues 
and nuclear processes that were highly cor- 
related, we used a one-sided Mann-Whitney 
U test to compare the correlation distribu- 
tion between the mutants of each H3 and H4: 
residue and the members of each process to 
(i) the correlations between the same H3 and 
H4 mutants and all genes not in that process 
and to (ii) the correlations between the same 
process and all other H3 and H4 mutants. The 
highest p value of the comparison to (i) or (ii) 
was recorded. False discovery rates (FDRs) 
for the links were then computed using the 
method of Benjamini and Hochberg (129) and 
are reported in table S5. The most significant 
links with FDR <10° were used for follow-ups. 


Spontaneous mutation frequency 


Cells were grown to saturation and then plated 
on yeast extract peptone dextrose (YEPD)- and 
5-fluoroorotic acid (5-FOA)-supplemented 
media. Mutants growing on 5-FOA were counted 
only after confirming that colonies growing on 
YEPD for all the strains were of equal size. The 
assay was repeated three times independently 
(table S6). 


MS quantification of H3K56ac levels 
Sample preparation 


Histone mutant cultures (WT, H3R63K, and 
H4R36K) were harvested in mid-log phase 
(ODe6o0 = 1.0) using a 250-mm ceramic filter 
funnel and 30-um nitrocellulose membranes 
connected to high continuous wall suction. 
Yeast were removed from the nitrocellulose 
membrane and flash frozen for storage or 
used immediately for protein extraction. Per 
gram of yeast pellet, 3 ml of Yeast-Protein 
Extract Reagent (Y-PER; ThermoFisher Sci- 
entific) with added protease inhibitors (COm- 
plete Sigma-Aldrich, one tablet per 50 ml), 
phosphatase inhibitors (PhosSTOP Sigma- 
Aldrich; one tablet per 50 ml), histone deacety- 
lase inhibitors (sodium butyrate 100 mM and 
nicotinamide 100 mM), and B-mercaptoethanol 
(15 mM) were added. The suspension was mixed. 
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on a gyrator at 4°C for 30 min and centrifuged. 
Pellets were resuspended in fresh Y-PER me- 
dium, and extraction was repeated two ad- 
ditional times for a total of three extractions. 
Pellets were sequentially washed twice with 
3 ml ddH.,O per gram of yeast. Histone extrac- 
tion was performed in the presence of 2.5 ml 
of 8M urea/0.4N sulfuric acid per gram of 
yeast protein pellets, incubated for 1 hour, 
centrifuged, and supernatants collected. Pro- 
teins were precipitated using a methanol- 
chloroform precipitation as previously described 
(130). Extracted proteins were trypsin digested; 
desalted and acetylated peptides were enriched 
as previously described (137). 


Generation of selected reaction monitoring 
(SRM) assays for acetylation sites 


Peptide mixtures (obtained from ThermoFisher) 
were analyzed by liquid chromatography- 
tandem mass spectrometry (LC-MS/MS) on a 
Thermo Scientific Orbitrap Fusion mass spec- 
trometry system equipped with a Proxeon 
Easy nLC 1200 ultra-high-pressure LC and 
autosampler system. Samples were injected 
onto a C18 column (25 cm x 75 um I.D. packed 
with ReproSil Pur C18 AQ 1.9-um particles) in 
0.1% formic acid and then separated with a 
60-min gradient from 5 to 40% buffer B (90% 
ACN/10% water/0.1% formic acid) at a flow 
rate of 300 nl/min. The mass spectrometer 
collected data in a data-dependent fashion, 
collecting one full scan in the Orbitrap fol- 
lowed by collision-induced dissociation MS/ 
MS scans in the dual linear ion trap for the 20 
most intense peaks from the full scan. Dynamic 
exclusion was enabled for 30 s with a repeat 
count of 1. Charge state screening was used 
to reject analysis of singly charged species or 
species for which a charge could not be as- 
signed. The raw data was matched to protein 
sequences using the MaxQuant algorithm (ver- 
sion 1.5.2.8) (132). Data were searched against 
a database containing SwissProt Human protein 
sequences concatenated to a decoy database 
where each protein sequence was randomized 
in order to estimate the FDR. Variable modi- 
fications were allowed for methionine oxida- 
tion and protein N-terminal acetylation and 
lysine acetylation. A fixed modification was 
indicated for cysteine carbamidomethylation. 
Full trypsin specificity was required. The first 
search was performed with a mass accuracy 
of +20 parts per million and the main search 
was performed with a mass accuracy of +4.5 
parts per million. A maximum of five modifi- 
cations were allowed per peptide. A maximum 
of two missed cleavages were allowed. The 
maximum charge allowed was 7+. Individual 
peptide mass tolerances were allowed. For 
MS/MS matching, a mass tolerance of 0.8 Da 
was allowed, and the top eight peaks per 100 Da 
were analyzed. MS/MS matching was allowed 
for higher charge states and water and ammonia 
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loss events. The data were filtered to obtain 
a peptide, protein, and site-level FDR of 0.01. 
The minimum peptide length was seven amino 
acids. Selected reaction monitoring (SRM) as- 
says were generated for selected acetylation 
sites. SRM assay generation was performed 
using Skyline (133). For all targeted proteins, 
proteotypic peptides and optimal transitions 
for identification and quantification were selected. 
based on a spectral library generated from the 
shotgun MS experiments. The Skyline spectral 
library was used to extract optimal coordinates 
for the SRM assays, for example, peptide frag- 
ments and peptide retention times. For each 
peptide, the five best SRM transitions were 
selected based on intensity and peak shape. 


Acquisition and quantification of 
acetylation sites by SRM 


Digested peptide mixtures were analyzed by 
LC-SRM on a Thermo Scientific TSQ Quantiva 
MS system equipped with a Proxeon Easy nLC 
1200 ultra-high-pressure LC and autosampler 
system. Samples were injected onto a C18 column 
(25 cm x 75 um I.D. packed with ReproSil Pur 
C18 AQ 1.9-um particles) in 0.1% formic acid 
and then separated with a 60-min gradient 
from 5 to 40% buffer B (90% ACN/10% water/ 
0.1% formic acid) at a flow rate of 300 nl/min. 
SRM acquisition was performed operating Q1 
and Q; at 0.7-unit-mass resolution. For each 
peptide, the best five transitions were moni- 
tored in a scheduled fashion with a retention 
time window of 5 min and a cycle time fixed to 
2 s. Argon was used as the collision gas at a 
nominal pressure of 1.5 mTorr. Collision ener- 
gies were calculated by, CE = 0.0348 x (77/z) + 
0.4551 and CE = 0.0271 x (m/z) + 1.5910 (CE, 
collision energy and m/z, mass to charge ratio) 
for doubly and triply charged precursor ions, 
respectively. SRM data was processed using 
Skyline (133). Protein significance analysis was 
performed using MSstats (134). Normalization 
of the intensities across samples was performed 
using the acetylated peptides H3K9_H3K14 
(peptide containing both acetylation sites), 
H3K23 and H3K14 as global standards, which 
did not show any change across the mutants. 
Log, fold changes were calculated from three 
independent runs and plotted (fig. S7C and 
table S6). 


Cryptic transcription—qPCR 


Total RNA was extracted from 10 ODgoo units 
of mid-log phase cells (WT and respective 
mutant strains) using hot acid phenol-chloroform 
extraction method as described. Ten ug of total 
RNA was DNAse I treated (Promega) followed 
by purification using an RNeasy minikit (Qiagen). 
One ug of DNAse I-treated total RNA was used 
to synthesize cDNA using SuperScript III first 
strand synthesis system (Life Technologies) 
and random hexamer primers. CDNA was diluted. 
1:25 before amplification by PCR using primers 
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designed for the 5’ and the 3’ ends of the STEII 
gene. GPCR was performed using SYBR green 
(Biorad) as described previously (135). Relative 
change in the transcript levels were estimated 
using the AAC, method described in (136) and 
were normalized to ACTI transcript (table S). 
Primer sequences are available upon request. 


Western blotting 


Whole-yeast-cell lysates were prepared using 
trichloroacetic acid (TCA) lysis as described 
previously (137). Lysates were subjected to im- 
munoblotting according to standard proce- 
dures, and proteins were detected using ECL 
Prime (Amersham Biosciences). Membranes 
were probed with aH3K36me3 antibody 
purchased from (Abcam, catalog no. 9050). 
Glyceraldehyde-phosphate dehydrogenase 
(GAPDH) was used as a loading control and 
detected using an antibody purchased from 
Sigma (catalog no. A9521). 


ATAC-seq 


Yeast cells (2.5 x 10°) were grown to mid-log 
phase, pelleted, washed with SB-buffer (1.4 M 
Sorbitol, 40 mM HEPES-KOH pH 7.5, 0.6 mM 
MgCl2), resuspended in 200 ul SB buffer + 
10 mM DTT with 10 ul of 10 mg/ml 100T 
zymolyase (MP Biomedicals) solution and 
incubated for 5 min at 30°C. Spheroblasted 
cells were washed with SB-buffer and incu- 
bated for 15 min at 37°C in 25 ul of transposase 
solution (12.5 ul 2x TD buffer, 1.25 ul Nextera 
enzyme, 11.25 ul water). DNA was purified 
(Qiagen MinElute DNA Purification Kit), am- 
plified, and barcoded by PCR. Purified PCR 
products were sequenced using an Illumina 
HiSeq 4000 sequencer. Sequence reads were 
trimmed and aligned to the genome of Saccha- 
romyces cerevisiae (version SacCer3 from 
hgdownload.cse.ucsc.edu/downloads.html), 
and reads with a length <100 basepairs (bp) 
were removed. Replicates belonging to an allele 
(WT, H3K36A, H3K122A, set2A) were merged 
and normalized to the smallest read number. 
For visualization of STE] read coverage using 
the IGV genome browser (138) (fig. S7G), each 
track was scaled linearly so that the largest 
peak in the displayed window is the same height 
for all tracks. Count files were generated with 
“featureCounts v1.5.3” (139). 

Gene body plots (fig. S7H) were generated 
as follows: First, counts from genes reported 
to be targets of cryptic transcription (n = 11; 
FLO8, AVOI, LCB5, SMC3, SPB4, APM2, DDCI, 
SYFI, OMS1, PUS4, STEI1), as well as counts 
400 bp up- and downstream of the respective 
gene bodies, were extracted. Second, up- and 
downstream regions were split into 50 bins of 
equal size (8 bp), whereas the gene body was 
split into 300 equal bins, resulting in 400 bins 
for each gene in each tested strain (WT, set2A, 
H3K36A and H3K122A). Third, for each of the 
400 bins, the average for the 11 target genes 
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was calculated. Fourth, each mutant allele was 
then scaled linearly so that the first bin (.e., 
400 bp upstream of the gene body start) was 
equal to that of WT. Finally, the WT counts 
were subtracted from the mutant counts for 
each bin. 
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The enhanced vegetation productivity driven by increased concentrations of carbon dioxide (CO2) 

[i.e., the CO> fertilization effect (CFE)] sustains an important negative feedback on climate warming, but 
the temporal dynamics of CFE remain unclear. Using multiple long-term satellite- and ground-based 
datasets, we showed that global CFE has declined across most terrestrial regions of the globe from 1982 
to 2015, correlating well with changing nutrient concentrations and availability of soil water. Current 
carbon cycle models also demonstrate a declining CFE trend, albeit one substantially weaker than 

that from the global observations. This declining trend in the forcing of terrestrial carbon sinks by increasing 
amounts of atmospheric CO; implies a weakening negative feedback on the climatic system and 
increased societal dependence on future strategies to mitigate climate warming. 


errestrial ecosystems have accounted for 
more than half of the global carbon sink 
during the last six decades and have thus 
substantially mitigated climate warming 
(1). Global process-based models attri- 
bute part of the increasing land carbon sink 
(2) to the increase in vegetation productivity 
driven by the fertilization effect of increasing 
atmospheric CO, concentration (3), ie. the 
CO, fertilization effect (CFE), a process that 
acts as a negative feedback in the climate 
system (4). First introduced by Keeling (5), 
the B factor generally is used to characterize 
the plant response to increasing CO, concen- 
tration. To compare the results between global 
analysis and experimental measurements, we 
used an approximation form of the original B 
(B = are) (6), which is defined as the relative 
increase in gross primary production (GPP) 
in response to a 100-ppm increase in atmo- 
spheric CO, concentration (Ca). 
Increases of GPP originate from the direct 
acceleration of photosynthesis in response to 


the increased supply of CO, (7), but they are 
also modified by a suite of indirect responses, 
including water saving because of the reduced 
stomatal conductance under increasing atmo- 
spheric CO, concentrations (8, 9) and nutrient 
limitation (JO). Enhanced GPP and net prima- 
ry production (NPP) are commonly observed 
in field experiments, such as free-air CO, en- 
richment (FACE) experiments (7) and open-top 
chamber experiments (/7), where ecosystems 
are exposed to elevated CO. in the range of 
two times ambient values, albeit with different 
response magnitudes across sites (7). 

The fingerprint of CFE on the carbon and 
water cycles on global scales (3, 8) is, however, 
more elusive, given the covariation of atmo- 
spheric CO, with other environmental drivers 
of vegetation productivity. Elucidating this 
fingerprint of CFE is a scientific problem in- 
volving detection and attribution methods, 
requiring either statistical methods and long- 
term data such that effects of non-CFE drivers 
can be removed empirically (12, 13), or process- 


based models with which CFE can be isolated 
by deliberate factorial simulations (14). 

Process-based models of the terrestrial car- 
bon cycle have indicated that CFE accounts for 
~70% of the increasing global trend in foliar 
area, i.e., global greening (15), and up to 60% of 
the current terrestrial carbon sink (3). These 
models project that CFE will induce increased 
land carbon storage by the end of this century, 
despite opposing effects from climate change 
(3, 16). A cascade of uncertainties in projecting 
future land carbon storage arises from the 
responses of photosynthesis, NPP, and ecosys- 
tem carbon turnover times to increasing COs, 
especially in the presence of increasing limi- 
tations from nutrient and water availability. 
Therefore, accurately quantifying the tempo- 
ral dynamics of CFE on GPP is essential for 
reducing the uncertainties of future land car- 
bon storage and climate projections derived 
on the basis of Earth system models (4). 

A study using the long-term CO, concen- 
tration measurements at the Barrow station 
(Alaska, 71°N) found that the sensitivity of 
the seasonal amplitude of atmospheric CO, 
to the increase in CO, concentration has de- 
creased, which suggests a declining response 
of GPP to CO, at northern high latitudes (17). 
If such declining trends prevail across the 
globe, the terrestrial vegetation carbon sink 
response to increasing CO» will decrease, with 
important consequences for the global carbon 
budget and for the effort required from miti- 
gation policies to meet future climate targets. 
For this reason, a comprehensive assessment 
of the temporal dynamics of CFE on global 
GPP is timely as a first step to understand the 
impacts of CFE on the trends of the land car- 
bon sinks. We used three long-term satellite 
datasets collected during 1982 to 2015: a re- 
cently developed vegetation index (NIRyv) (8) 
from the Advanced Very High Resolution Radio- 
meter (AVHRR NIRy), the fusion of NIRy from 
AVHRR and the Moderate Resolution Imag- 
ing Spectroradiometer (AVHRR+MODIS NIRy), 
and the fusion of NIRy and sun-induced 
chlorophyll fluorescence (SIF) (19) (NIRy+SIF), 
which are proxies for GPP (hereafter, satellite 
GPP proxies) (supplementary text S1 and figs. 
S1 to S5) (20). We also corroborated our find- 
ings using long-term GPP time series from 
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eddy covariance (EC) flux towers, from a light 
use efficiency (LUE) model (27), and from an 
ensemble of terrestrial carbon cycle models 
(TRENDY v6, trends and drivers of the re- 
gional scale sources and sinks of carbon di- 
oxide) (14). 


Global temporal trend of B from observations 
and models 


Linear and nonlinear multiple regression ap- 
proaches were used to estimate 6 from satellite 
GPP proxies at each pixel across the globe 
(12, 13, 20). Pixels with large land-cover changes 


Fig. 1. Declining trend of global A 
B. (A) Temporal dynamics of B for 
three satellite GPP proxies with 
15-year moving windows during 
1982 to 2015. The gray area 
indicates 1 SD on either side of 
the mean (n = 12,850). The 

trend and statistical significance 
(p value) of the B time series were 
estimated using the Mann-Kendall 
test. (B) (Left) The histogram 
distribution of B across all pixels 10 
in two 15-year periods. B was 1982-1996 
the average of these three 

Satellite GPP proxies. (Right) 
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were excluded from the analyses (supplemen- 
tary text S2 and fig. S6). The global median ß 
during 1982 to 2015 was 16.1 + 11.5% 100 ppm” 
(fig. S7), consistent with the FACE experi- 
ments (7), which on average suggested a 15.5% 
increase in light-saturated photosynthesis per 
100 ppm COs». Results showed that the esti- 
mated ß values for different vegetation phys- 
iology and biome types were also aligned with 
experimental results (7), e.g., the B values of 
C4 plants were much smaller than those of 
Cz plants (supplementary text S3 and fig. S8). 
We next calculated the time series of B with 
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15-year moving windows during 1982 to 2015 
and found that B significantly decreased at a 
rate of —0.92 + 0.12% 100 ppm™ year (p < 0.01) 
(Fig. 1A). This decrease was also evident after 
excluding the crop areas (fig. S9). Compared 
with the global median of the declining trend 
in B, grasslands and plants in cold climate 
zones exhibited a larger declining trend of ß, 
whereas the B trends of shrubs and plants in 
tropical areas were slightly lower; neverthe- 
less, the decreases in B across all the climate 
zones and vegetation types were notable (fig. 
S10). Across the global terrestrial areas, B 
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Boxes represent the interquartile ranges of the B values (solid lines represent medians), and whiskers extend to one times the interquartile range. Median B values 
for these two periods and their SDs are shown at the top of the graphs. The asterisk indicates a significantly different B between these two periods, on the basis 


of a two-sample Kolmogorov-Smirnov test at p < 0.01. 
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Fig. 2. Global declining trend of B. Spatial patterns of trends in B derived from three datasets using 15-year moving windows, reported as the percentage per 
100 ppm per year. (A) Means from three satellite GPP proxies. (B) The revised EC-LUE GPP. (C) Multimodel mean GPP determined from the TRENDY v6 ensemble. 
All datasets are from 1982 to 2015. The Mann-Kendall test was used to estimate the trend of B pixel by pixel, and the regions with black dots indicate significant trends 
(p < 0.05). The pixel size is 1° (D) Mean declining trend of B from different datasets. The error bars represent SEs. 
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significantly declined from 21.8 + 13.1% 100 ppm 
during 1982 to 1996 to 12.9 + 11.6% 100 ppm" 
during 2001 to 2015 (Fig. 1B). 

We also found a robust decrease in B when 
we used another satellite NIRv time series (fig. 
S11), various land cover change masks (fig. S12), 
different window lengths (supplementary text 
S4 and fig. S13), various combinations of ex- 
planatory variables (supplementary text S5 and 
figs. S14 and S15), and different definitions 
of growing seasons (fig. S16) and when we 
included the consideration of seasonal precip- 
itation (Supplementary text S6 and fig. S17). 
After considering the uncertainties from orig- 
inal satellite data and from the methods, the 
global B estimated from satellite GPP proxies 
during 2001 to 2015 was still significantly 
lower than that during 1982 to 1996 (supple- 
mentary text S7 and figs. S18 and S19). To 
verify whether the trend of B was an artefact 
of our regression method, we also estimated 
the temporal dynamics of B using the optimal 
fingerprint attribution method (supplemen- 
tary text S8) and again found that the de- 
clining trend of B was significant (fig. S20). 
Altogether, these results based on satellite ob- 
servations of GPP proxies suggest a significant 
declining trend in the response of GPP to in- 
creasing atmospheric COs. 

To further assess the decreases in B inferred 
from satellite GPP proxies, we used other in- 
dependent datasets: (i) the GPP time series 
derived from the revised EC-LUE model (27) 
(which also accounted for the direct CO, ef- 
fects on LUE); (ii) satellite-based leaf area 
index (LAI) time series; (iii) multiyear GPP 
estimations from 22 EC flux sites (table S4); 
and (iv) a global long-term GPP dataset from 
the Carbon Cycle Data Assimilation System 
(CCDAS) (supplementary text S9 to S12). We 
also found a significant declining trend of B 
on the basis of the revised EC-LUE GPP, with a 
rate of -0.82 + 0.18% 100 ppm™ year™ (fig. 
S21), similar to the estimates from satellite GPP 


Fig. 3. Relationship between the 
residual trend of ß and foliar 
nutrient concentrations. Results 
for foliar N (A) and foliar P 

(B) concentrations after 
accounting for site mean annual 
air temperature, and mean annual 
precipitation from 3846 samples 
on the basis of a spatial mixed- 
effects model. Model performance 
and results are presented in table 
S5. Data are classified into 50 bins 
for clear visualization on the basis 
of foliar N or P concentrations. 
The red dots represent the means 
for each bin, and the blue lines 
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proxies. However, the result from satellite- 
based LAI showed a smaller declining rate of 
B (-0.59 + 0.11% 100 ppm year") (fig. $22). 
This divergence suggested that the decrease 
in B was likely due to two factors: the direct 
effect on foliar physiology and the indirect 
effect on LAI. The former refers to the CO,- 
induced stimulation of carboxylation per unit 
leaf area, whereas the latter depicts the in- 
creased carbon sequestration used for leaf 
area expansion. The smaller declining B trend 
from satellite LAI suggested that these two 
effects were both relevant for the full descrip- 
tion of the global decreases in the CO, effect 
on GPP. Ground GPP estimates from EC flux 
sites also confirmed our findings, by indicat- 
ing an average declining rate of B at -0.70% 
100 ppm™ year, which was comparable to 
the results from satellite GPP proxies around 
these sites (fig. S23). On the basis of the 
CCDAS GPP, a data-model fusion product 
constrained by atmospheric CO, observations 
and which is independent from satellite data, 
we also found a declining 6 trend at a rate of 
~0.62% 100 ppm” year” (fig. S24). The overall 
consistency between various remote-sensing 
data, ground measurements, and a carbon 
assimilation system suggests that the global 
decline of B is robust and coherent across 
multiple, independent observations. 


Spatial pattern of the temporal trend of B 
across the globe 


The geographic distribution of the temporal 
trends of B from satellite GPP proxies (Fig. 2A) 
revealed that B decreased in ~86% of global 
lands (fig. S25, A to C). Areas with declining B 
spanned over most of the globe. By contrast, 
increasing B was found in some limited re- 
gions of Southeast Asia, east Australia, and 
North America (Fig. 2, A and B). These in- 
creasing B trends are likely driven by an in- 
tensification of management in croplands (i.e., 
irrigation and fertilization) or related to in- 
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creasing atmospheric nutrients deposition in 
recent years (fig. S33). We also observed sim- 
ilar spatial patterns for 6 trends calculated 
using both 10-year and 17-year moving win- 
dows (fig. S26). Overall, the decreases of B in 
tropical areas were smaller, whereas cold re- 
gions had a slightly stronger declining trend 
(figs. SIOA and S25, A to C). The latter may be 
because the declining signals of B in northern 
high latitudes include both the indirect effects 
on LAI and the direct effect on foliar physiol- 
ogy, whereas those in warmer climates include 
only the physiological effect, given that LAI in 
some tropical humid regions may already be 
close to saturation. Results of the satellite LAI 
confirmed this interpretation by suggesting a 
larger reduction of B in colder northern re- 
gions than in warmer climates (fig. S25, E 
and F). Similar declines in B were obtained 
from the revised EC-LUE GPP, which covered 
~74% of the global terrestrial area (Fig. 2B). In 
general, all datasets indicated that B has been 
declining in most global lands and across var- 
ious vegetation types and climate zones. 

We also investigated whether an ensemble 
of state-of-the-art carbon cycle models (J4) was 
able to reproduce these observed global de- 
clines of B on GPP. We used results from 12 
models that contributed to the TRENDY v6 
ensemble to calculate B (20) (supplementary 
text S13). For each model, we extracted the 
differences in simulated GPP between SI (time- 
varying CO, only) and SO (constant CO») sce- 
narios, which represented the CO, effect on 
vegetation productivity. These models pre- 
dicted a negative trend in ß, both for the 
multimodel mean and for the individual one, 
but the declining rate was clearly lower than 
that derived from satellite GPP proxies and 
with no evident spatial variations (Fig. 2C and 
fig. S27). Grouping the estimates of CFE trends, 
we found that the global B declined at a rate of 
-0.92 + 0.12% 100 ppm™ year™ for satellite 
GPP proxies and only —0.12 + 0.01% 100 ppm 
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represent the SDs of the means. The gray dots represent the raw individual samples, and black lines represent the linear regressions of these gray dots. The 
correlation coefficients (r) and p values were calculated on the basis of the raw data (n = 3846). 
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year’ for TRENDY GPP (Fig. 2D). B trends of 
individual models were variable, from —0.06 
to —0.21% 100 ppm year * (fig. S28), but all of 
them were lower than that from satellite GPP 
proxies. Given that we defined £ as the percent 
increase in GPP per 100-ppm increase in CO, 
and that the SI simulations solely considered 
the CO, effect, the smaller decreases in B from 
TRENDY GPP were likely caused by the sat- 
urating physiological response of GPP to CO, 
(22), without adequately capturing the con- 
current emergence of other limiting factors 
driven by the changing environmental con- 
ditions. We also used GPP from the TRENDY 
S2 and S3 scenarios to estimate the trends in 
B by using the regression approaches (20). 
The temporal trends of B remained unal- 
tered, with a value of approximately -0.12 + 
0.12% 100 ppm year’ for both TRENDY 
S2 and S3 simulations (fig. S29). These re- 
sults highlight that the ongoing strong de- 
crease in B inferred from satellite datasets is 
probably underestimated by TRENDY models. 


Possible mechanisms accounting for the 
declining ß 


Two possible non-mutually exclusive hy- 
potheses were proposed to account for the 
declining B and to explain why TRENDY mod- 
els failed to adequately replicate the mag- 
nitude of this decline: (i) the increasing 
constraints on vegetation productivity from 
emerging nutrient limitations and imbalances 
that were not adequately represented in mod- 
els and (ii) current models underestimated 
the sensitivity of terrestrial GPP to changes in 
water availability. 

The first hypothesis relates to the possible 
effect of the growing limitation of key nu- 
trients, including nitrogen (N) and phosphorus 
(P). Using 410 groups of ground-based foliar 
N and P observations (supplementary text 
S14 and fig. S30), we found a general decrease 
in foliar N and P concentrations, with mean 
values of -0.24 + 0.06% year™ and -0.55 + 
0.06% year‘, respectively (fig. S31, A and B). 
Our findings are consistent with a recent study 
suggesting a general global pattern of decreas- 
ing foliar N concentration (23) and with many 
examples of local to regional decreases in fo- 
liar N and P concentrations (24, 25). Enhanced 
GPP from the increasing atmospheric CO, 
concentrations, leading to larger NPP and 
higher nutrient demands by plants, may partly 
explain the observed declines of foliar N and P 
concentrations. Concurrently, these decreases 
in key foliar nutrients may impose limitations 
on GPP and thereby limit B. To test this hy- 
pothesis, we applied a linear spatial mixed- 
effects model (table S5) to investigate the 
relationship between the trend of B and foliar 
N or P concentrations (supplementary text S14 
and fig. S32). After accounting for the trends 
in B explained statistically by mean annual air 
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temperature and mean annual precipitation, 
we found clear positive correlations between 
the residual declining trends of B with both 
foliar N and P concentrations across European 
forests (Fig. 3). These results suggested that 
vegetation with lower foliar nutrient concen- 
trations generally showed larger declines in 8, 
therefore supporting our hypothesis of a role 
for nutrient limitation in the temporal dynam- 
ics of B. 

This phenomenon was also verified from 
global trends in atmospheric N and P deposi- 
tions, which suggested that areas with decreas- 
ing atmospheric nutrient supplies probably 
had larger decreases in 8 (Such as Europe and 
Siberia) and vice versa (such as East Asia) (fig. 
S33). This may be because vegetation in areas 
with declining nutrient supplies from atmo- 
spheric depositions tend to have larger N and 
P limitations on GPP. Moreover, the increases 
of N and P depositions in some regions of East 
Asia may explain the increasing trend of ß in 
these areas (Figs. 2A and fig. S33). The ongoing 
decreases in foliar nutrients might constrain 
the plant photosynthetic capacity and result in 
the decline of B, which might not have been 
adequately represented in current models (sup- 
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plementary text S14). Regarding this aspect, 
the models that included C-N cycle interactions 
to emulate nutrient constraints exhibited a 
larger declining rate of B (fig. S34), partly con- 
firming this interpretation. The role of N lim- 
itation on B has been widely suggested by 
experimental evidence (J0, 26, 27), model 
analyses (28), and synthesis reviews (29, 30), 
all of which have been consistent with our 
analyses. Our finding regarding P limitation 
was also consistent with FACE experiments, 
which demonstrated that insufficient P avail- 
ability generally had negative impacts on £ 
(31). Furthermore, as foliar P:N ratios posi- 
tively correlated with plant net photosynthesis 
and growth (32), the decreasing foliar P:N 
ratios in European forests (fig. S31C) suggested 
a worsening nutrient imbalance that may part- 
ly account for the observed decline in B. 

The second hypothesis to explain the diver- 
gence of the B trend between the TRENDY 
models and the satellite-derived estimates stated 
that these models underestimated the sensi- 
tivity of GPP to water availability, because the 
coupling between water and carbon in models 
is underestimated (33). To test this hypothesis, 
we analyzed the sensitivity of satellite GPP 


Fig. 4. Sensitivities of GPP to 
TWS. The changes in satellite 
GPP proxies or TRENDY GPP per 


~ unit change in TWS were esti- 
ia mated for arid (A), temperate 
= (B), and dry tropical (C) climate 
& zones using a moving window 
w of 15 years. The sensitivity time 
series have been standardized. 
The solid lines in the left panels 
represent the linear regressions. 
The shaded areas represent 
the SEs. The bars and error bars 
in the right panels represent 
the slopes and their SEs, 
Ia respectively. 
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proxies and TRENDY GPP to water availability 
using a moving window of 15 years (supple- 
mentary text S15). We used the terrestrial water 
storage (TWS) data to represent the availability 
of water to plants and selected arid, temperate, 
and dry tropical climate zones as the research 
areas (fig. S35), as the GPP in these zones was 
found to be highly sensitive to TWS (fig. S36). 
We observed that the sensitivity of GPP to TWS 
was relatively constant across these three zones 
for the multimodel mean of the TRENDY GPP, 
whereas the TWS sensitivities derived from sat- 
ellite GPP proxies exhibited significantly in- 
creasing trends for arid (2.93 + 0.38% year™), 
temperate (2.12 + 0.39% year”), and dry trop- 
ical (2.30 + 0.57% year") ecosystems (Fig. 4). 
In temperate and dry tropical areas, the ma- 
jority of models exhibited divergent results 
compared to satellite GPP proxies, and almost 
all of them largely underestimated the GPP 
sensitivities to TWS in arid areas (fig. S37). The 
decreases in B were larger in the regions where 
the increases of TWS sensitivities were higher 
(fig. S38). These strongly divergent sensitivi- 
ties between satellite GPP proxies and TRENDY 
GPP were also confirmed when using various 
TWS products (20) (supplementary text S15 
and fig. S39) and when using a shorter moving 
window (fig. S40). 

This finding implies that GPP and conse- 
quently land carbon uptake are more sen- 
sitive to the variations in water availability 
than assumed by the TRENDY models, as 
previously suggested by several recent studies 
(34, 35). Moreover, a recent study showed 
that the CFE of grasslands could be reduced 
under drier conditions (36). The relationship 
between drought stress and CFE is complex 
and ecosystem specific and may be affected 
by the total annual rainfall as well as rain- 
fall seasonality. It was shown, for instance, 
in 19 grasslands experiments that the annual 
CFE was negated when spring precipitation 
became too low (37). Field experiments also 
support our results by suggesting that the 
CFE on vegetation productivity is at least part- 
ly limited by water availability (38). The sig- 
nificant increases in the GPP sensitivities to 
TWS may thus partly explain the decreases 
of B in arid, temperate, and dry tropical cli- 
mate zones. 

From a theoretical perspective, the global 
declines of CFE may result from several fac- 
tors. First, given that the CO.-induced photo- 
synthesis stimulation at the leaf level is scaled 
up to the canopy level through LAI (39, 40), 
the declining ß on the basis of satellite LAI (fig. 
$22) could partly explain the global decreases 
of the GPP response to CO». Consistent with 
our findings, a recent study using the FACE 
experiments on mature forests found a rela- 
tively low CO, effect on GPP (4D, possibly be- 
cause the LAI of these forests did not change 
much (39). Second, the CO, effect on photo- 
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synthesis at the leaf level involves both the 
stimulation of carboxylation and increases in 
water use efficiency (WUE). The former may 
possibly be regulated by the foliar key nu- 
trients (N and P), and the latter is related to 
the water availability. According to the pro- 
gressive N limitation theory, the soil N avail- 
ability for plant growth may be expected to 
diminish over time (42), possibly leading to 
the observed global decreases in f. A recent 
study using carbon isotope measurements re- 
vealed diminishing CO,-induced WUE gains 
across global forests (43), which supported 
our findings of the effects of water supply 
limitation on CFE and could partly explain the 
global declines in B. Moreover, complex inter- 
active effects between nutrients and water 
supply may also have impacts on CFE. For 
example, low soil water supply could possi- 
bly strengthen the nutrient constraints on 
CFE through limiting nutrient decomposition 
and diffusion in soils (38). Our further analysis 
using model simulations showed that the GPP 
trends have clear reductions in arid areas, 
once the interactions between N limitation 
and climate constraints are considered, sup- 
porting this hypothesis (fig. S41B and sup- 
plementary text S16). Nevertheless, an excess 
of N fertilizer may result in a reduction in 
the soil water and therefore possibly lead to 
drought stress (44). Regarding this aspect, 
our analysis also highlights the need for future 
efforts to better understand the complex inter- 
actions between nutrients, water, and CO, ef- 
fects on vegetation from a climate change 
perspective. Additional mechanisms (e.g., plant 
acclimation or changes in plant species over 
time) may also explain the observed global de- 
crease in CFE (supplementary text S17). 


Conclusion 


Our analyses showed a significant and spa- 
tially extensive decline in B, which implies a 
substantial reduction of the positive effects of 
increasing atmospheric CO, on terrestrial car- 
bon uptake. A recent study suggested that the 
CO, effect on the carbon cycle in tropical re- 
gions (3) might be counteracted by impacts 
from climate-driven changes (45), in agreement 
with our findings. Although still under debate, 
the possible increasing trend of the airborne 
fraction of anthropogenic CO, may imply a 
saturation of the CO. sinks from land and 
oceans (46-49), which may be partly caused 
by the global decline in CFE. Current carbon 
cycle models also exhibit such global decreases 
in B but fail to adequately detect the sharp 
declining trend that we identified from satel- 
lite data. This divergence between observations 
and process-based models possibly originates 
from the models’ limitations in adequately re- 
presenting the emerging decline in key foliar 
nutrient concentrations and the increasing 
constraints of water limitations on vegetation 
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productivity. Ultimately, these results indicate 
that terrestrial photosynthesis may not in- 
crease as much as models project, possibly 
reducing the potential of land-based climate 
mitigation, further accelerating global warm- 
ing and exacerbating the efforts required for 
meeting climate targets. Our findings also high- 
light the need for better characterizations of 
the biogeochemical and hydrological effects 
on vegetation in current carbon cycle models 
to produce more robust projections of the ter- 
restrial carbon budget for the next decades. 
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Monolithic perovskite/silicon tandem solar cell with 
>29% efficiency by enhanced hole extraction 


Amran Al-Ashouri!*, Eike K6hnen?*, Bor Li’, Artiom Magomedov, Hannes Hempel’, 
Pietro Caprioglio’*, José A. Márquez, Anna Belen Morales Vilches”, Ernestas Kasparavicius“, 
Joel A. Smith®’”, Nga Phung®, Dorothee Menzel’, Max Grischek''*, Lukas Kegelmann’, Dieter Skroblin®, 
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Dieter Neher*, Martin Stolterfoht*, Thomas Unold®, Vytautas Getautis?, Steve Albrecht!” 


Tandem solar cells that pair silicon with a metal halide perovskite are a promising option for surpassing 
the single-cell efficiency limit. We report a monolithic perovskite/silicon tandem with a certified power 
conversion efficiency of 29.15%. The perovskite absorber, with a bandgap of 1.68 electron volts, 
remained phase-stable under illumination through a combination of fast hole extraction and minimized 
nonradiative recombination at the hole-selective interface. These features were made possible by a 
self-assembled, methyl-substituted carbazole monolayer as the hole-selective layer in the perovskite cell. 
The accelerated hole extraction was linked to a low ideality factor of 1.26 and single-junction fill factors 
of up to 84%, while enabling a tandem open-circuit voltage of as high as 1.92 volts. In air, without 
encapsulation, a tandem retained 95% of its initial efficiency after 300 hours of operation. 


tandem solar cell, consisting of a silicon 

cell overlaid by a perovskite solar cell 

(PSC) (D), could increase efficiencies of 

commercial mass-produced photovol- 

taics beyond the single-junction cell 
limit (1, 2) without adding substantial cost 
(3, 4). The certified power conversion effi- 
ciency (PCE) of PSCs has reached up to 25.5% 
for single-junction solar cells (usual active area 
of ~0.1 cm”) (5), 24.2% for perovskite/CIGSe 
(copper-indium-gallium-selenide) tandem cells 
(~1 cm?) (5-7), 24.8% for all-perovskite tan- 
dem cells (0.05 cm”) (8, 9), and 26.2% for the 
highest openly published perovskite/silicon 
tandem efficiency (~1 cm”) (10). Perovskite/ 
silicon tandem cells have additionally under- 
gone technological advances in both stability 
and compatibility with textured silicon sub- 
strates (11-13). However, these perovskite- 
based tandem solar cells still have room for 
improvement, as practical limits for all these 
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tandem technologies are well above 30% 
(14, 15). 

The increase in PSC efficiency has been 
driven in part by advances in physical and 
chemical understanding of the defect and re- 
combination mechanisms. Some reports pre- 
sented near-perfect passivation of surfaces and 
grain boundaries, with photoluminescence 
quantum yields (PLQYs) approaching theoret- 
ical limits (46-18). Consequently, PSCs were 
reported with open-circuit voltage (Voc) values 
of only a few tens of meV below their radiative 
limit (9-23). These values surpass those reached 
with crystalline silicon absorbers and are com- 
parable with solar cells based on epitaxially 
grown GaAs (23, 24). However, perovskite com- 
positions with a wider bandgap that are needed 
for high-efficiency tandem solar cells still show 
considerable Voc losses (14, 25). The main reasons 
include comparably low PLQYs of the absorber 
material itself, an unsuitable choice of selective 
contacts, and phase instabilities. Even state-of- 
the-art perovskite/silicon tandem cells still have 
Voc values well below 1.9 V. 

We present a strategy to overcome these 
issues simultaneously, demonstrated with a triple- 
cation perovskite composition with a bandgap 
of 1.68 eV, which enables photostable tandem 
devices with a Voc of 1.92 V. We note that the 
charge extraction efficiency, and hence the fill 
factor (FF), of PSCs is still poorly understood. 
Although reported PSCs usually feature a small 
active area (~0.1 cm?) with small absolute 
photocurrents (a few milliamperes), and thus 
small series resistance losses at the contacts, 
typical FFs of high-efficiency devices generally 
range from 79 to 82%. However, on the basis of 
the detailed balance limit, PSCs should be able 
to deliver a FF of 90.6% at a bandgap of 1.6 eV. 
Wider-bandgap perovskite compositions near 
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1.7 eV seem especially prone to low FFs, result- 
ing in tandem cell FF values commonly below 
77%, near current-matching conditions (11, 12, 26). 
In optimized perovskite single-junction devices, 
the FFs only recently exceeded 80%, with a 
maximum value of 84.8% (27). 

One reason for the low FF might be that 
there are only a few techniques for quantifying 
and analyzing the FF losses in PSCs. We show 
that intensity-dependent transient photolumi- 
nescence in combination with absolute photo- 
luminescence is a viable technique for doing so. 
A main FF limitation of high-efficiency PSCs 
is the ideality factor nyp, with typical values of 
1.4 to 1.8 for high-Voc devices (28), whereas 
established solar cell technologies reach values 
of 1 to 1.3 (29). Thus, an important goal for 
perovskite photovoltaics is to lower the ideal- 
ity factor while minimizing nonradiative inter- 
face recombination to achieve a high Voc (28). 
We designed a self-assembled monolayer (SAM) 
with methyl group substitution as a hole- 
selective layer, named Me-4PACz ([4-(3,6- 
dimethyl-9H-carbazol-9-yl)butyl]phosphonic 
acid) and show that a fast hole extraction 
went along with a lower ideality factor. Thus, 
FFs of up to 84% in p-i-n single-junction PSCs 
and >80% in tandem devices were achieved. 

The SAM provided both fast extraction and 
efficient passivation at the hole-selective inter- 
face. This combination slowed light-induced 
halide segregation of a tandem-relevant perov- 
skite composition with 1.68-eV bandgap, al- 
lowed a PLQY as high as on quartz glass, and 
led to high single-junction device Voc values of 
>1.23 V. The single-junction improvements 
transferred into tandem devices, which allowed 
us to fabricate perovskite/silicon tandem solar 
cells with a certified PCE of 29.15%. This value 
surpasses the best silicon single-junction cell 
(26.7%) and is comparable to the best GaAs 
solar cell (27) at the same area of 1 cm’. 
Under maximum power point (MPP) tracking in 
ambient air without encapsulation, a Me-4PACz 
tandem cell retained 95% of its initial ef- 
ficiency after 300 hours. We used injection- 
dependent absolute electroluminescence (EL) 
spectroscopy to reconstruct the individual sub- 
cell current-voltage curves without the influence 
of series resistance (pseudo-J-V curves), which 
showed that the tandem device design that 
features only a standard perovskite film without 
additional bulk passivation could in principle 
realize PCE values up to 32.4%. 


Stabilization of wide-bandgap perovskite with 
the hole-selective layer 


The ideal top cell bandgap for perovskite ab- 
sorbers in conjunction with CIGSe and Si bottom 
cells is ~1.68 eV (30-32). These wider-bandgap 
compositions often feature a Br/I ratio of 
>20%, which can lead to phase instabilities 
caused by light-induced halide segregation, most 
strikingly evident from photoluminescence (PL) 
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spectra that show a double-peak formation 
under continuous illumination (33, 34). Upon 
generation of charge carriers in the perovskite 
film, iodide-rich clusters can form that are 
highly luminescent because they serve as 
charge carrier sinks, given their lower band- 
gap relative to the surrounding material (35). 
As quantified by Mahesh et al., although some 
portion of the Voc loss is related to halide 
segregation, the dominant source of Voc loss is 
likely the generally low optoelectronic quality of 
the Br-rich mixed-halide perovskite absorbers, 
or high nonradiative recombination rates at 
their interfaces (35). Hence, to unambiguously 
determine the limitations and potentials of 
wide-bandgap compositions, it is necessary to 
find suitable charge-selective contacts that do 
not introduce further losses or instabilities. 

We show that fast charge extraction paired 
with surface passivation can effectively sup- 
press the formation of a double-peak emission 
in the PL, indicative of phase stabilization, and 
simultaneously enable a high quasi-Fermi level 
splitting (QFLS) and device performance. Rather 
than optimizing the perovskite composition or 
passivating the film, we chose a variant of the 
widely used Cs-, FA-, and MA-containing “triple- 
cation” perovskite (36) that is highly reproducible 
(FA, formamidinium; MA, methylammonium) 
and focused on preparing an optimal charge- 
selective contact on which the perovskite 
film was deposited. We enlarged the band- 
gap by increasing the Br/I ratio to obtain a 
1.68-eV (23% Br) absorber instead of the 
commonly used 1.60 to 1.63 eV (~17% Br), 
yielding a nominal precursor composition of 
CSo.05(FAo.77MAo.23)0.95Pb(0.77Bro.23)3- 

A schematic of the device stack and the 
hole-selective layers (commonly abbreviated 
as HTLs, “hole-transporting layers”) used for 
PL measurements is shown in Fig. 1. We first 
compared the QFLS measured by absolute PL 
and then the PL stability of this perovskite com- 
position prepared on indium tin oxide (ITO) 
substrates covered by the HTLs. In recently 
published high-PCE p-i-n (“inverted”) single- 
junction and tandem PSCs, the polymer poly 
[bis(4-phenyl)(2,4,6-trimethylphenyl)amine ] 
(PTAA) or the comparable poly[N,N’-bis(4- 
butylphenyl)-NV,N’-bis(phenyl)}-benzidine] (polyTPD) 
is typically used (0, 11, 37, 38). Alternatively, 
SAMs based on carbazole, such as MeO-2PACz 
and 2PACz, can form passivated interfaces while 
allowing for low transport losses because they 
are ultrathin (<1 nm) (7). The introduction of 
a methyl-group substitution to the “lossless” 
hole-selective interface created by 2PACz (7) 
led to a more optimized alignment with the 
perovskite valence band edge (see energetic 
band edge diagram in fig. S1) with a similar 
dipole moment (~1.7 D) and resulted in faster 
charge extraction. The supplementary mate- 
rials contain the synthesis scheme of the SAMs 
we used. In the literature concerning the n-i-p 
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configuration of PSCs, methoxy substituents 
are prevalent in HTLs, with some reports of a 
possible passivation function at the perovskite 
interface (39-42). For the p-i-n configuration, 
however, the standard high-performance HTLs 
PTAA and polyTPD contain alkyl substituents. 
In the present study, we directly compared 
methoxy and methyl substituents in p-i-n cells 
with MeO-2PACz and Me-4PACz, with the re- 
sults showing advantages for the methyl sub- 
stitution with respect to both passivation and 
hole extraction. We tested the influence of the 
aliphatic chain length (7) in carbazole-based 
SAMs without (nPACz) and with methyl sub- 
stitution (Me-nPACz) on PSC performance and 
found an optimum FF at n = 2 for nPACz and 
n = 4 for Me-nPACz (see fig. S23). For n = 6, 
both SAMs led to current-voltage hysteresis. 

The QFLS values of bare perovskite films 
(Fig. 1B) deposited on 2PACz and Me-4PACz 
were similar to that on quartz glass, commonly 
regarded as a perfectly passivated substrate 
(16). Perovskite compositions with high Br con- 
tent typically segregate into I-rich phases indi- 
cated by increased PL intensity at lower photon 
energies, here at a wavelength of 780 nm (33). 
Pristine regions of the nonsegregated perovskite 
film emitted photons at a peak wavelength of 
~740 nm for perovskite deposited on glass 
(Fig. 1C) or ITO/PTAA (Fig. 1D), and a similar 
response was seen for the SAM MeO-2PACz 
(fig. S3) on ITO. However, the perovskite emis- 
sion was more stable over time on ITO/2PACz 
and ITO/Me4:PACz substrates (Fig. 1E and fig. 
S3). The raw spectra are shown in fig. S4. 

Among the studied HTLs, phase segrega- 
tion was inhibited only if the perovskite was 
grown on a substrate that fulfilled the require- 
ments of both fast charge extraction and good 
passivation; Fig. 1F shows that passivation 
alone was insufficient. The black curve shows 
a PL spectrum of the perovskite film on an 
insulating glass substrate that was covered 
by Me-4PACz after 10 min of continuous spot 
illumination with 1-sun equivalent photon 
flux. The illuminated film showed signs of 
I-rich phases emitting at a center wavelength 
of ~780 nm. The glass substrate ensured that 
no hole transfer out of the perovskite bulk oc- 
curred. In contrast, a conductive ITO substrate 
that allowed hole transmission in combina- 
tion with Me-4PACz increased the PL stability, 
as evidenced by the sharp peak with emission 
centered at ~740 nm even after 10 min of spot 
illumination. 

A bare ITO substrate seemed to prevent 
charge accumulation as well, allowing a stable 
PL peak position at 1-sun intensity (spot size 
0.12 cm”; see fig. S6). The connection between 
charge accumulation in the perovskite and 
phase instability was reported in previous 
studies in which a reduced density of carriers 
increased the activation energy of mobile ion 
species and allowed the film to remain in its 
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initial form (43, 44). Spot illumination (0.12 em? 
with 1-sun photon flux) represented increased 
stress testing on phase stability compared to 
full illumination because it created an outward 
driving force for ions from the illuminated 
area (45). Consequently, a smaller illumination 
spot (i.e., larger edge-to-area ratio) at the same 
illumination intensity showed a faster PL red- 
shift (see figs. S5 and S6). To compare the 
degree of PL redshift and double-peak forma- 
tion, we evaluated the ratio of the two emis- 
sion center intensities at 740 and 780 nm for 
two different excitation fluences equivalent to 
1-sun and 30-sun illumination (Fig. 1, G and 
H). At 1-sun-equivalent intensity, only 2PACz 
and Me-4PACz on ITO had a stable ratio. 
However, upon increasing the intensity and 
thus the charge carrier generation rate by a 
factor of 30, a Me-4PACz-covered ITO sub- 
strate differed from the 2PACz-covered sub- 
strate by still displaying a similarly stable PL 
intensity ratio. 

We used transient photoluminescence (TrPL) 
to analyze charge carrier transfer into adjacent 
charge-selective layers (46). The full decay is 
governed by nonradiative, trap-assisted surface/ 
bulk recombination (mostly monoexponential 
decay), radiative recombination (“bimolecular,” 
second-order decay), and charge transfer ef- 
fects, which can be disentangled if these time 
constants differ sufficiently from each other (78). 
Figure 2A presents PL transients of 1.68 eV- 
bandgap perovskite films on ITO/HTL substrates. 
With MeO-2PACz and PTAA, it was not pos- 
sible to clearly differentiate between charge 
extraction and trap-assisted recombination be- 
cause the nonradiative recombination was high 
(as evidenced by lower QFLS values relative to 
quartz glass; Fig. 1B) and because the tran- 
sients did not saturate toward one process. In 
contrast, the PL transients for 2PACz and Me- 
4PACz showed a clear monoexponential decay 
at later times, indicating Shockley-Read-Hall 
recombination (47). Fits to the TrPL transients 
(fig. S8) were used to compute the differential 
lifetime t = -{d In[o(¢)]/dt}" (Fig. 2B), where 
o(t) is the time-dependent PL photon flux. In 
this representation, the processes that reduce 
the PL counts over time are separable, and the 
transient decay time (or “lifetime”) is directly 
readable at each time point (46). 

The asymptotically reached high TrPL life- 
times of >5 us for both 2PACz and Me-4,PACz 
suggests that there were minimal nonradiative 
recombination losses at the SAM interfaces. The 
charge transfer process at early times (until 
~1 us) led to a sharp rise of t, resembling 
simulated curves by Krogmeier et al. (46). The 
transition from increasing lifetime to the plateau 
marks the end of charge transfer, and non- 
radiative first-order recombination becomes 
dominant. Because PLQY measurements of 
films on 2PACz and Me-4PACz indicated a 
similar level of interface recombination under 
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the same charge generation conditions (see 
also fig. S9), the steepness of this rise was 
influenced by the charge transfer speed. The 
observed gradient for Me-4PACz implied a 
faster hole transfer to the underlying ITO 
relative to 2PACz, with the saturation starting 
after ~300 ns rather than ~1 us. 

In the charge carrier generation regime of 
this experiment (~1 sun, ~3 x 10°’ cm”), 


trap-assisted recombination dominated, with 
the PL flux scaling proportionally to the den- 
sity of photogenerated carriers n, as evidenced 
by intensity-dependent TrPL shown in fig. S9. 
Figure S9 further demonstrates that at higher 
generation conditions, the PL flux scaled pro- 
portionally to n”, where transients usually show 
a multiexponential signature, as seen with 
2PACz and quartz (fig. S10). Nonetheless, in this 


regime the Me-4PACz transients remained 
monoexponential until generation densities 
exceeded ~35 suns equivalent. We interpret this 
as a consequence of a large hole-extraction flux, 
which causes first-order recombination to dom- 
inate even in this injection regime. 

We quantify this phenomenon of persisting 
domination of first-order recombination in 
Fig. 2C by displaying the ratio of higher-order 
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Fig. 1. Photoluminescence properties and stability assessment of perovskite 
films on different substrates. (A) Schematic description of the photoluminescence 
(PL) experiment and chemical structure of a general carbazole-based 

SAM, with R denoting a substitution, which in this work is either nothing 
(2PACz), a methoxy group (MeQ-2PACz), or a methyl group (Me-4PACz). The 
number 2 or 4 denotes the number of the linear C atoms between the 
phosphonic acid anchor group and the conjugated carbazole main fragment. 
(B) Quasi-Fermi level splitting (QFLS) values of nonsegregated 1.68-eV bandgap 
perovskite films on a bare glass substrate and different hole-selective layers on 
the transparent and conductive indium tin oxide (ITO). Error bars denote the 
global error of the evaluation method (~20 meV). (C to E) Time-dependent 
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photoluminescence spectra analyzing phase stability of perovskite absorbers with 
1.68-eV bandgap. The perovskite films were deposited either on glass (C) or on ITO 
substrates with different hole-selective layers [(D) and (E)]. The color scale is at the 
far right. (F) PL spectra before (dashed lines) and after 600 s of light-soaking (solid 
lines) under 1-sun equivalent illumination in air, comparing the perovskite grown 
on Me-4PACz that had been deposited on a glass substrate and a conductive ITO 
substrate. (G) Ratio of PL intensities at 780 nm (I-rich domains) and 740 nm (neat 
perovskite) from the PL evolutions in (C), (D), (E), and two other hole-selective layers 
(see fig. S4; illumination spot size ~0.12 cm*), shown as a figure of merit for phase 
stability. (H) Ratio of PL intensities as in (G), but at higher illumination intensity 
through decrease of the excitation spot size to 0.4 mm*. 
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to first-order recombination for the differ- 
ent generation conditions (see supplementary 
text for the evaluation method). Comparison 
of Me-4PACz to 2PACz indicates that the hole- 
extraction flux of Me-4PACz was larger by a fac- 
tor of >10, because the curvature of the TrPL 
transient only begins to resemble that of 2PACz 
at a factor of >10 higher generation density 
(indicated by the blue dashed line in Fig. 20). 

The carrier mobilities determined by optical 
pump terahertz probe measurements (fig. S12) 
were similar between perovskite films grown 
on the different HTLs. To also exclude differ- 
ences in perovskite composition and crystal 
orientation due to possible growth differences, 
we probed the effect of the HTL on these 
properties by grazing-incidence wide-angle 
x-ray scattering at the four-crystal monochro- 
mator beamline of the Physikalisch-Technische 
Bundesanstalt (48). Azimuthally integrated dif- 
fraction patterns collected on a movable PILATUS 
detector module (49) showed comparable com- 
position in each case (fig. S13), with marginally 
increased PbI, scattering intensity on PTAA as 
we observed in our previous work (7). Compar- 
ing azimuthal intensity profiles for perovskite 
scattering features (fig. S14), we found a 
negligible difference in crystallographic orien- 
tation between the samples. 

Our complete solar cells were capped by Ceo 
as the electron-selective contact. The electron 
extraction speed did not limit the cell opera- 
tion, as demonstrated by a time-resolved terahertz 
photoconductivity measurement combined with 
TrPL on a quartz/perovskite/Cso sample (fig. 
S11). We compared the decays of free charge 
carriers after interface-near carrier generation 
on both sample sides and found an electron 
transfer time constant of ~1 ns, substantially 
faster than hole transfer at the hole-selective 
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Fig. 2. Role of charge transfer in transient photoluminescence (TrPL). 

(A) PL transients of perovskite on ITO/hole-selective layer substrates. The dashed 
lines indicate the background levels. (B) Computed differential lifetimes from fits 

to the transients in (A), showing the single-exponential decay time at each time of the 
transient, with early times shown in the inset. The inset highlights the region of the 
Me-4PACz and 2PACz transients that is governed by hole transfer into the ITO. Excitation 
density is similar to 1-sun conditions (fluence of ~30 nJ/cm*, 2 x 10” to 3 x 10° cm’?). 
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layer (in the range of ~100 ns). The extrac- 
tion velocity into the Cg in our model was 
1.6 x 10* cm/s (see fig. S11 for details), a value 
similar to earlier reported velocities (46). 


Performance of perovskite single-junction 
solar cells 


For analysis at the solar cell level, we focused 
on the simple single-junction device stack glass/ 
ITO/HTL/perovskite/Cg,/SnO./Ag, with the 
SnO, serving as a buffer layer for indium 
zinc oxide (IZO) sputtering in the fabrication 
of tandem solar cells (50). We found that the 
combination of fast charge extraction and 
passivated interface not only mitigated phase 
instability (see Fig. 1) but was also linked to 
an increased FF of solar cell devices, mainly by 
a decreased diode ideality factor of the PSCs. 
The FF is the major remaining parameter for 
which PSCs have not yet come close to the 
values of established solar cell technologies 
(24, 51) (see fig. S16 for FF comparisons), with 
the ideality factor being one of the main prop- 
erties that limit high-efficiency PSCs (29). 
MeO-2PACz and 2PACz led to FFs of up to 
82% (Fig. 3A), whereas with Me-4PACz the 
values were as high as 84%, representing ~93% 
of the radiative limit. 

Figure 3B shows J-V curves recorded at 
simulated AM1.5G illumination conditions, com- 
paring champion PTAA and Me-4PACz cells 
of the same batch and showing the superior 
performance of the SAM. The ideality factors 
Mp for PSCs with different HTLs (Fig. 3C and 
Table 1) were ~1.26 for Me-4PACz, ~1.42 for 
2PACz, 1.51 for MeO-2PACz, and ~1.55 for PTAA 
cells. Figure S20 compares the Voc values 
achieved with the different HTLs. Despite the 
large differences in passivation at the hole- 
selective interface, the differences in Voc were not 
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as large (average difference of 30 mV between 
PTAA and Me-4:PACz) because of the limiting 
nonradiative recombination at the Cg interface. 
However, as reasoned above, the Cg layer 
did not limit charge extraction, hence the dif- 
ferent extraction speeds invoked by the HTLs 
directly influenced the FF values. The high 
FF with Me-4PACz was accompanied by 
high Voc values of up to 1.16 V; when a LiF 
interlayer was placed between the perovskite 
and Cso, we achieved a maximum voltage of 
1.234 V (52, 53) (Fig. 3D and fig. S20). The 
combination of a high Voc with low np was 
previously considered as challenging for PSCs 
(28), and it allowed us to fabricate a perovskite 
single junction with a PCE of 20.8% with 
Me-4PACz (fig. S18) and a perovskite band- 
gap of 1.68 eV. 

To investigate the FF values without the 
influence of series resistance losses, we mea- 
sured intensity-dependent absolute PL spectra 
and computed the QFLS values [or implied 
Voc (tVoc)] as a function of the illumination 
intensity. The derived data pairs of generation 
currents and 72Voc values allowed the recon- 
struction of hypothetical, so-called pseudo-J-V 
curves, as recently shown in (54) (Fig. 3D). The 
extracted FF and pseudo-FF values (FF in ab- 
sence of transport losses) of bare perovskite 
films grown on different HTLs are summarized 
in Table 1, row 1. Both 2PACz and Me-4PACz 
enabled high “pseudo-FF” (pFF) values of 
~88%, which is 96.8% of the detailed balance 
limit and similar to the value achieved on a 
bare quartz substrate. PTAA allowed for a 
pFF of only 85.6%. 

This analysis highlights how the SAMs 
formed a practically lossless interface between 
ITO and perovskite. Interestingly, when includ- 
ing a Ceo layer on top of the perovskite film, no 
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Colors are as in (A). The shaded areas are a guide marking the approximate time 
domain in which the Me-4PACz transient is governed by charge transfer. (C) Ratio of 
higher-order processes to monoexponential decay in the TrPL transients, revealing 
that Me-4PACz not only extracts holes faster [inset in (B)] but does so at ~10 times 
the efficiency of 2PACz, because the Me-4PACz transient shows the same 
magnitude of radiative recombination only with charge carrier generation that is 
higher by a factor of ~10 (comparison along the dashed line; see fig. S10 for details). 
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Fig. 3. Performance and fill factor loss analysis of p-i-n solar cells with 
different hole-selective layers. (A) Comparison of fill factor values of PSCs 
with the stack glass/ITO/HTL/perovskite/Cgo/SnO2/Ag, triple-cation perovskite 
absorber with 1.68-eV bandgap. All data are from cells made from the same 
perovskite precursor and contact processing batch. The boxes indicate the 
25/75 percentiles; the whiskers indicate the 10/90 percentiles. (B) J-V curves 
of the best cells of the batch in (A) and a J-V curve of a Me-4PACz cell from 
another batch with LiF interlayer between Ceo and perovskite, reaching a Voc 
of 1.234 V. (C) Intensity-dependent open-circuit voltage Voc with linear fits 
(dashed lines). (D) Pseudo—J-V curves reconstructed from intensity-dependent 


absolute PL measurements on the illustrated sample stack. The 2PACz and 
Me-4PACz curves almost coincide; the dashed lines represent pseudo—J-V curves 
from the sample variations including the electron-selective Ceo layer, with which 
all curves are comparable because of the limiting nonradiative recombination at the 
Ceo interface. (E) Pseudo—J-V curves reconstructed from the measurements in (C). 
Table 1 summarizes the FF values extracted from the pseudo-J-V curves. (F) Repartition 
of loss mechanisms lowering the cell’s FF below the detailed balance limit, 
comparing PTAA and Me-4PACz cells: nonradiative loss in neat material (= radiative FF 
limit minus pFF of neat film), nonradiative interface loss (= pFF of neat film minus pFF 
of full cell), and transport loss (= pFF of full cell minus FF of measured solar cell). 


fragment is ~0.2 D for MeO-2PACz, ~1.7 D for 


Table 1. Comparison of “pseudo” fill factors (pFF) and implied open-circuit voltages (iVoc). 
The values were derived from suns-PL and suns-Voc measurements for our perovskite film on all 
Studied hole-selective layers and on quartz glass. The table also shows the maximum FF attained in 
J-V measurements (max FF) (see also Fig. 3). “Half cell” refers to substrate/HTL/absorber, whereas 
“full cell” denotes the complete solar cell including Cso, SnO» and Ag metal electrode. 
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differences between the studied HTLs for the 
WVoc and pFF were apparent (Fig. 3D, dashed 
lines; Table 1, row 2), as the Ceo layer sets an Voc 
limitation through high nonradiative recom- 
bination rates (53). This limitation was only 
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overcome with a counter electrode on the Ceo 
(Fig. 3E and full devices), which underscores 
the role of the dipoles that Me-4PACz and 
2PACz created at the ITO surface. The calculated 
molecular dipole value of the hole-transporting 
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Me-4PACz, and ~2 D for 2PACz. The positive 
dipoles shifted the work function of the ITO 
toward higher absolute numbers (fig. S2A), 
which presumably resulted in a higher built- 
in potential throughout the device (55, 56). A 
well-defined built-in potential can exist with 
the presence of a second electrode countering 
the ITO—in this case, Ag or Cu. Thus, when 
reconstructing the J-Vs from the suns-Voc mea- 
surement on full devices in Fig. 3C to extract 
the pFF (Table 1, row 3), both 2PACz and Me- 
4PACz overcame the pFF and 2Voc limitations 
imposed by the Ceo layer (Fig. 3E). 

The differences between the electrical J-V 
curves (max. FF 84%) in Fig. 3B and pseudo- 
J-V curves (max. FF ~88%) arose from transport 
losses caused by the finite mobility of the Ceo, 
non-optimized sample design, and ITO sheet 
resistance, as well as the measurement setup. 
Figure 3F summarizes a comparison of the 
different contributions to FF losses for PTAA 
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Fig. 4. Characteristics of monolithic perovskite/silicon tandem solar cells using 
various HTLs. (A) Schematic stack of the monolithic perovskite/silicon tandem solar 
cell. (B) SEM image of a tandem cross section with Me-4PACz as HTL. (C) Statistics of 
the PCE of PTAA, MeQ-2PACz, 2PACz, and Me-4PACz tandem solar cells from J-V 
scans. (D) Certified J-V curve measured at Fraunhofer ISE, including the MPP value 
and the device parameters (red), in comparison to a tandem cell with PTAA (gray) as 
HTL measured in-house. (E) External quantum efficiency (EQE) and reflection (denoted 
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as 1-R) of the certified tandem cell measured in-house. The AM1.5G-equivalent current 
densities are given. (F) Long-term MPP track using a dichromatic LED illumination of 
nonencapsulated solar cells in air at a controlled temperature of 25°C and relative 
humidity of 30 to 40%. The data are normalized to the MPP average of the first 

60 min of each individual track to account for measurement noise. Because of the fast 
degradation, the MPP track of the PTAA + LiF cell is normalized to the first recorded 
value. The legend specifies each HTL and notes whether a LiF interlayer was used. 
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Fig. 5. Luminescence subcell analysis of a tandem solar cell with Me-4PACz 
and LiF interlayer. (A) Absolute PL spectra of the subcells recorded under 1-sun 
equivalent illumination. The excitation wavelengths are 455 nm and 850 nm for the 
perovskite and silicon subcell, respectively. PL images constructed from the integrated 
PL fluxes are also shown. The edge length of the active area (inner square) is 1 cm. 
(B) Reconstructed J-V curves calculated from injection-dependent electroluminescence 
(EL) measurements (open symbols) and shifted by the photogenerated current 


density. Furthermore, the perovskite subcell is fitted with a single-diode model (solid 
brown line). The reconstructed tandem J-V (dashed line) was calculated by adding 
the voltages of the subcells for each current density. The J-V measurement under 
simulated 1-sun illumination of this cell is shown as a solid red line. Furthermore, a 
photograph of the tandem solar cell at high injection current is shown. Due to a 
bandgap of 1.68 eV, the subcell emits light in the visible wavelength range and thus, 
the emission is visible by eye and with a regular digital camera. 


and Me-4:PACz, derived from comparisons of 
the pseudo-J-V curves to the measured J-V 
curves and radiative limits, as previously re- 
ported by Stolterfoht et al. (54). In addition to 
nonradiative losses at the PTAA interface 
(red), the film thickness (~10 nm, versus <1 nm 
with a SAM) and low conductivity of the 
PTAA led to greater transport losses than 
with Me-4PACz. 


Integration into monolithic perovskite/silicon 
tandem solar cells 


Efficient passivation in combination with fast 
hole extraction of Me-4PACz in perovskite 
single junctions could be transferred into 
monolithic tandem solar cells, which led to 
higher FF, Voc, and stability. A schematic 
stack of this solar cell is shown in Fig. 4A. We 
used a silicon heterojunction solar cell as the 
bottom cell (26), based on a 260-um-thick n- 


Al-Ashouri et al., Science 3770, 1300-1309 (2020) 


type float-zone Si wafer processed as described 
in the supplementary materials. The textured 
rear side enhanced the near-infrared (NIR) 
absorption, whereas the polished front side 
enabled the deposition of spin-coated perov- 
skite. The 20-nm ITO recombination layer also 
served as the anchoring oxide for the SAMs (7). 
The top cell, with the same 1.68-eV perovskite 
bandgap and nominal precursor composition 
CSo.05(FAo.77MAo.23)0.95Pb(10.77Bro.23)3 as ana- 
lyzed above, formed the single-junction stack 
of ITO/HTL/perovskite/(LiF)/Cg o/SnO./IZO/ 
Ag/LiF. Figure 4B shows a scanning electron 
microscopy (SEM) cross-section image of a 
part of the tandem solar cell; no obvious dif- 
ferences were observed between perovskite 
films on the different HTLs (fig. S24). The mo- 
lecular SAM cannot be resolved with SEM. 
Figure $25 shows a photograph and layout 
of the tandem device. 
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Figure 4C compares the PCE of tandem 
solar cells based on PTAA, MeO-2PACz, 2PACz, 
and Me-4PACz, with and without a LiF inter- 
layer at the perovskite/Cgo interface. With PTAA, 
the LiF interlayer led to rapid degradation of the 
cells (see fig. S26 for individual parameters). 
Without the interlayer, we achieved an average 
PCE of 25.25%. In contrast, the average ef- 
ficiency of MeO-2PACz and 2PACz was 26.21% 
and 26.56%, respectively. The use of a LiF 
interlayer for Me-4PACz cells increased the 
Voc but reduced the FF. Thus, both config- 
urations reached a similar average PCE of 
26.25% and 26.41%, respectively. However, 
with Me-4PACz the maximum PCEs (<29%) 
are higher than cells with 2PACz, mainly 
because of higher FF of up to 81%. These 
high FF values were achieved despite almost 
all cells being perovskite-limited (table S1). 
The statistics of all photovoltaic parameters 
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are shown in fig. S26. The J-V measurements 
of the champion cells of each configuration 
are shown in fig. S28; the PV parameters are 
summarized in table S2. 

The tandem solar cells did not reach FF 
values comparable to those in single-junction 
cells because of the larger active area (1 cm”) 
and a transparent conductive oxide (TCO) 
without grid fingers, leading to increased 
series resistance. The cells showed very high 
Voc values of up to 1.92 V (fig. S30). With a 
Voc of ~715 mV from the bottom cell at half 
illumination (fig. S31), the contribution of 
the perovskite subcell was ~1.2 V. Figure 4D 
shows a direct comparison between champion 
PTAA and Me-4PACz tandem cells; besides 
the 50-mV improvement in Voc, the enhanced 
hole extraction boosted the FF by ~4% absolute. 

We sent a tandem cell with Me-4PACz and 
a LiF interlayer to Fraunhofer ISE CalLab for 
independent certification (Fig. 4D; see fig. 
S32 for certificate). With a Voc of 1.90 V, FF 
of 79.4%, and a short-circuit current density 
Jsc of 19.23 mA cm™, the cell had a PCE of 
29.01% when measuring from Jsc to Voc, sim- 
ilar to our in-house measurement (fig. S33) and 
was certified at the MPP with a PCE of 29.15% 
with a designated area of 1.064 cm”. This PCE 
surpasses other monolithic (J0, 27) and four- 
terminal perovskite-based tandem solar cells 
(57) and is on par with the best GaAs single 
cell with the same active area (27). 

Figure 4E shows the external quantum ef- 
ficiency (EQE) of the certified tandem cell. 
Under AM1.5G-equivalent illumination condi- 
tions, the photogenerated current densities 
Jpn in the perovskite and silicon subcells were 
19.41 mA cm™ and 20.18 mA cm’, respec- 
tively, which agreed with the measured Jsc of 
19.23 mA cm™. The tandem solar cell exhibited 
a nonideal current mismatch of 0.77 mA cm”? 
and even though the perovskite cell sets 
the slope around 0 V, the cell reached a FF of 
79.5%. The cumulative photogenerated cur- 
rent density and loss caused by reflection were 
39.59 mA cm~” and 2.57 mA cm ”, respectively. 
A comparison of EQEs and reflection losses 
between a cell of this work (planar front side) 
and a fully textured cell by Sahli et al. (58) is 
shown in fig. S34. 

After the certification, we fabricated more 
Me-4PACz tandem solar cells without a LiF 
interlayer (fig. S26), which showed average 
performance similar to that with LiF. The 
champion cell showed a higher FF of 81% 
and lower Voc of 1.87 V than without LiF. 
Together with a Jsc of 19.37 mA cm”, this led 
to a PCE of 29.29% and a stabilized efficiency 
of 29.32% (fig. S35). 

We measured the stability of different non- 
encapsulated tandem solar cells (Fig. 4F). 
To track the degradation induced by either 
the top or the bottom cell more carefully, we 
developed a dichromatic LED setup using 
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LEDs with center emission wavelengths of 
470 nm and 940 nm (fig. S36) and with inde- 
pendent intensity calibration and recording. 
We adjusted the mismatch so that the Jpn in 
the individual subcells was equal to that 
measured under AM1.5G-equivalent illumi- 
nation to maintain proper stability tracking 
of monolithic tandem solar cells (see below 
and supplementary text). The devices were 
measured under continuous MPP load (using 
voltage perturbation), at 25°C and in ambient 
air with 30 to 40% relative humidity. The 
photogenerated current densities of the sub- 
cells are given in table S3 and set which subcell 
is limiting. The degradation for a perovskite- 
limited tandem cell with Me4PACz+LiF showed 
75.9% of its initial efficiency (29.13%) after 
300 hours. When we substituted the Me-4,PACz 
with PTAA (perovskite-limited), the PCE de- 
creased to 74.5% of its initial PCE (25.9%) after 
only 90 hours. 

We additionally tracked a cell with Me- 
4PACz as HTL without a LiF interlayer to test 
the intrinsic stability of the HTL/perovskite 
combination. After 300 hours, the cell still 
operated at 95.5% of its initial PCE. Although 
the cells were current-matched, this track moni- 
tors a degradation of the perovskite, as it directly 
translates into the performance of the tandem 
cell and no degradation of the Si subcell is 
expected within these time scales. Our com- 
parison strongly suggests that the use of a LiF 
interlayer reduces the stability. As described in 
other reports (59-62), the decrease in stability 
might be caused by deterioration of the elec- 
trodes and Cg, interface upon migration of Li* 
and F ions. We note that it is important to 
declare the mismatch conditions because 
the use of a NIR-poor spectrum would lead 
to a Si-limited cell and thus to a higher stabil- 
ity (see supplementary text). Comparing this 
result to state-of-the-art stability tests of non- 
encapsulated tandem solar cells in ambient 
conditions, where the cells retained 90% of 
initial PCE after 61 hours (58) and 92% after 
100 hours (13), our Me-4PACz tandem solar 
cell showed a superior operational stability. 

In addition to the long-term stability mea- 
surements at 25°C, we conducted an MPP track 
of a Me-4-PACz tandem cell at elevated temper- 
atures. Following the procedure of Jost et al., 
the temperature was successively increased 
from 25° to 85°C and back to 25°C (63). There 
was no loss in PCE after this 200-min proce- 
dure, despite the high MA and Br amount of the 
wide-bandgap perovskite used here (fig. S39). 


Subcell J-V characteristics of a monolithic 
tandem solar cell 


One downside of monolithic multijunction 
solar cells is that the subcell characteristics 
are barely accessible. External quantum effi- 
clency measurements are the only subcell- 
resolved measurements presented in almost 


11 December 2020 


all publications reporting multijunction solar 
cells. Here, we used absolute PL measurements 
in each subcell of a representative tandem 
solar cell (Me-4PACz + LiF). With this, we 
could estimate the QFLS, and thus the Voc 
was accessible for both subcells independently. 
For this, we used hyperspectral absolute PL 
imaging at equivalent I-sun conditions with 
an illumination spot larger than the area of 
the solar cells. The PL spectra and the inte- 
grated images are shown in Fig. 5A. 

From the high-energy slope of the absolute 
PL spectra of the subcells, the individual im- 
plied Voc values were calculated: 1.18 V for 
the perovskite subcell and 0.72 V for the Si 
subcell (18, 64). From the PL spectra, we cal- 
culated the PLQY of both subcells, yielding 
values of 1.5% for Si and 0.02% for the perov- 
skite. PLQY values exceeding 5% have already 
been demonstrated in perovskite single-junction 
devices for lower bandgaps (19). 

To estimate the pseudo-J-V curves of the 
subcells, we performed absolute EL imaging, 
where the excess charge carriers are generated 
electrically to access the subcell characteristics 
(65-68). For each injected current, an EL image 
was recorded, from which the voltage of the 
subcells can be calculated from an average over 
the active area (fig. S40). With the recon- 
structed pseudo-J-V curves from injection 
current-dependent EL imaging, we analyzed 
the maximum possible efficiency of this cell 
stack with minimized charge transport losses 
(see supplementary materials for more details). 
We reconstructed both subcell J-V curves by 
calculating the implied voltage at each injected 
current, yielding a “pseudo” light-J-V (JV) 
curve for each subcell after shifting it by the 
respective photogenerated current density Jp, 
calculated from EQE measurements; these 
Jpn Values amounted to 18.7 and 20.6 mA cm? 
for the top and bottom cell, respectively. The 
open symbols in Fig. 5B show the measured 
EL data points averaged over the perovskite 
and silicon subcell and shifted by their re- 
spective Jp, values. 

For the perovskite, we additionally fitted 
the data with a single-diode model to display 
the J-V curve over the whole voltage range, 
which was otherwise not accessible during the 
EL measurement. To obtain the tandem JV x, 
we added the voltages of the subcells for each 
current density. The dashed line shows the 
result. The reconstructed curve deviated from 
the electrically measured J-V curve under a 
solar simulator. This is mainly because EL 
gave access only to the internal voltage, whereas 
an electrical J-V curve displays the current 
density versus external voltage (which is af- 
fected by series resistances; see supplementary 
text). Hence, a high FF (87.8%) of the JVEL 
can be regarded as the maximum achievable 
value for this particular tandem cell if the 
electrodes and all charge-selective layers were 
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free of series resistance losses. This would give 
a PCE of 31.7%, surpassing the theoretical PCE 
maximum of a silicon single cell (29.4%) (69). 
Thus, this cell stack has the capacity to over- 
come the 30% barrier through technical opti- 
mization of the contacts alone. However, by 
adjusting the mismatch conditions, even higher 
efficiencies are achievable. 

To find the requirements for the highest 
efficiency, we fit the silicon subcell with a 
single-diode model. We conducted SPICE 
(Simulation Program with Integrated Cir- 
cuit Emphasis) simulations to sweep the 
photogenerated current densities in the sub- 
cell. The single-diode models of the silicon 
and perovskite subcells were connected in 
series (schematically shown in fig. S42A), and 
the cumulative current density was fixed to 
39.3 mA cm” (as calculated from EQE mea- 
surements for AM1.5G-equivalent illumina- 
tion). Figure S42B shows the photovoltaic 
parameters as a function of the mismatch 
(Jpnsi — Jphpero) AS Shown in a previous pub- 
lication, the Voe is almost independent of the 
mismatch, whereas the FF is affected by it 
(26). A minimum FF occurs when the Jpn,si iS 
0.7 mA cm™ below the Jph Peros However, 
simultaneously the Jsc is highest under this 
condition. In a current-matching situation, 
the highest efficiency is 32.43%. This sets an 
estimation for the efficiency potential upon 
reduction of all transport losses. Decreasing 
the limitations set by the hole extraction speed, 
as shown in this work, is a viable path for 
exploring this potential. 
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Optically addressable molecular spins for quantum 


information processing 


S. L. Bayliss'*, D. W. Laorenza**, P. J. Mintun’, B. D. Kovos’, D. E. Freedman’, D. D. Awschalom’**+ 


Spin-bearing molecules are promising building blocks for quantum technologies as they can be 
chemically tuned, assembled into scalable arrays, and readily incorporated into diverse device 
architectures. In molecular systems, optically addressing ground-state spins would enable a wide range 
of applications in quantum information science, as has been demonstrated for solid-state defects. 
However, this important functionality has remained elusive for molecules. Here, we demonstrate such 
optical addressability in a series of synthesized organometallic, chromium(IV) molecules. These 
compounds display a ground-state spin that can be initialized and read out using light and coherently 
manipulated with microwaves. In addition, through atomistic modification of the molecular structure, we 
vary the spin and optical properties of these compounds, indicating promise for designer quantum 


systems synthesized from the bottom-up. 


ptically addressable solid-state spins 
(1-4) are an important platform for 
quantum information science, with im- 
pressive demonstrations ranging from 
quantum teleportation (5) to the map- 
ping of individual nuclear spins (6). The optical- 
spin interface of these solid-state systems is 
crucial for a diverse range of applications, from 
nanoscale sensing to long-distance quantum 
communication, as it enables straightforward 
single-spin readout and initialization. How- 
ever, for this family of qubits, synthetic tuna- 
bility of optical and spin properties, deterministic 
fabrication of multiqubit arrays, or translation 
of spin centers between different host mate- 
rials and devices remain outstanding goals. 
By contrast, chemical synthesis of molecular 
spin systems affords bottom-up qubit design 
(7, 8). A chemical approach offers tunability 
through atomistic control over the qubit; scal- 
ability through chemical assembly of extended 
structures; and portability across different envi- 
ronments (e.g., solution, surface, solid-state), 
because the qubit is not confined to a specific 
host. These capabilities provide substantial 
control over the intrinsic and extrinsic envi- 
ronment of molecular qubits. Notably, with 
chemical synthesis, nuclear spins can be con- 
trollably placed around a molecular qubit (9), 
arrays of spins can be created in one-, two- and 
three-dimensional architectures (10, 11), and 
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molecular spins can be integrated into elec- 
tronic and photonic devices (12, 13). Molecular 
systems have shown impressive capabilities, 
including long spin coherence times (14, 15), 
manipulation of photoexcited triplet states 
(16-18), and quantum optics with spin-singlet 
(S = O) organic molecules (19). However, in 
contrast to spins in semiconductors, the ground- 
state spins of molecular systems have lacked 
an optical-spin interface for both qubit ini- 
tialization and readout (20). Creating such 
an interface in a molecular platform would 
generate a class of qubits that can be engi- 
neered with atomic precision, with potentially 
transformative applications for bottom-up 
quantum technologies ranging from quan- 
tum sensors to hybrid quantum systems. 

Here, we synthesize a series of molecular 
qubits with such an optically addressable 
ground-state spin. We show that these molec- 
ular spin qubits can be initialized and read 
out with light, and coherently manipulated 
with microwave fields. Notably, chemical mod- 
ification of these qubits modulates the result- 
ing optical-spin interface, illustrating the 
potential of bottom-up qubit creation for op- 
tically addressable spin systems. 

To achieve the desired optical addressabil- 
ity, we target a molecular system consisting 
of a metal ion bonded to organic moieties 
(ligands), comprising a portable qubit of ~1-nm 
size. This metal-organic motif provides a well- 
defined qubit through the electronic spin of 
the central metal ion, and the surrounding 
ligands enable synthetic modification of the 
optical and spin properties. 

The key requirements for such an optically 
addressable molecular spin qubit are (i) a 
ground-state spin that can be coherently ma- 
nipulated and (ii) a spin-selective optical pro- 
cess to initialize and read out the spin. To 
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achieve these functionalities, we selected a 
chromium ion (Cr**) coordinated by strong- 
field (aryl) ligands in a high-symmetry config- 
uration, which gives rise to the energy-level 
structure shown in Fig. 1A. The d? electronic 
configuration of Cr** in a pseudo-tetrahedral 
environment produces a spin-triplet (S = 1) 
ground state with a small ground-state zero- 
field splitting, characterized by the parameters 
D and E (Fig. 1A and fig. S1), allowing for spin 
manipulation at readily available microwave 
frequencies. 

A strong ligand-field environment ensures 
that the lowest-lying electronic excited state is 
a spin-singlet (S = 0) (2D. This configuration 
leads to narrow optical transitions between 
the S = 1 ground state and the S = 0 excited 
state, which, when combined with the ground- 
state zero-field splitting, enables optical spin 
readout and initialization (i.e., spin polariza- 
tion) through spin-selective resonant excita- 
tion. First, optical readout of the ground-state 
spin is possible because a probed spin sub- 
level (e.g., |0) in Fig. 1A) will give rise to more 
photoluminescence (PL) than the other spin 
sublevels (e.g., |t1) in Fig. 1A). Second, optical 
polarization of the ground-state spin results 
when selective excitation, combined with spon- 
taneous emission, transfers population from 
the probed to the other spin sublevels (22). This 
is referred to as optical pumping or hole burn- 
ing. To accumulate spin polarization over 
multiple excitation and emission cycles, the 
ground-state spin-lattice relaxation time (T;) 
must be much longer than the excited-state 
lifetime (Topt). These components are the key 
ingredients that we use to obtain the desired 
optical-spin interface. 

With these criteria in mind, we synthesized 
three Cr** compounds (Fig. 1B), which differ 
by the placement of a single CH; (methyl 
group) on the coordinating ligands, through 
solution-phase chemistry. In brief, we react the 
appropriate aryl lithium species with Cr°*Cl, 
(THF); at —78°C, which undergoes a dispro- 
portionation or auto-oxidation to yield the 
corresponding tetrahedral Cr**R,, [R = 0-tolyl, 
2,3-dimethylphenyl, 2,4-dimethylphenyl; see 
(23) for further details] (24). We diluted each 
compound in their S = 0, isostructural tin an- 
alogs to form dilute molecular crystals (1, 2, 
3), illustrated in Fig. 1C, thus reducing inter- 
actions between Cr** centers. The metal-center 
symmetry in these crystals varies from rela- 
tively high, S, for 1, to low, C, for 3, directly 
affecting the ground-state spin structure (23). 
All experiments were performed on 1-3 in an 
optical cryostat with microwave access [=4 to 
5 K at the sample mount, Fig. 1C and (23)] 
unless stated otherwise. 

Under off-resonant excitation (785 nm), 
ground-state population is promoted to the 
first S = 1 excited state, undergoes fast inter- 
system crossing to the S = O state, and decays to 
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Fig. 1. Generating an optical 
interface for ground-state 
molecular spin qubits. 

(A) Energy-level diagram 

of Cr** in 1-3 depicting 
photoluminescence (PL) from 
the S = O state. (B) Molecular 


S=0 





resonant 
excitation 


structures for 1-3 determined 

by single-crystal x-ray diffraction. =1 
Hydrogen atoms are omitted 10) 
for clarity. Ligand modifications 
for 2 and 3 are highlighted in 
green and blue. Chromium and 
carbon atoms are shown in 
purple and gray, respectively. 
(C) Experimental schematic 
depicting optical excitation and 
PL collection for spin initialization 
and readout. Each Cr** com- 
pound is diluted in a single 
crystal (purple) of the iso- 
structural S = QO tin (Sn) analog. 
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An illustrative structure is shown. A microwave field (B1) from a waveguide (gold) is used for spin manipulation, and a static field (By) enables Zeeman splitting. 
(D) PL spectra for 1-3 at 4 K using off-resonant (785 nm) excitation. (E) Zeeman splitting of the zero-phonon line of 1 at 9 T. (F) Optical lifetimes for 1-3 measured 
using resonant excitation at the zero-phonon line. (G) X-band continuous-wave electron spin resonance (cw-ESR) spectra for 1-3 collected at 77 K. Simulations 
are shown in black, along with extracted D and E parameters. The central resonances at g = 2 to 2.1 are discussed in (23). 


the S = 1 ground state, emitting near-infrared 
PL. For 1-3, this emission comprises sharp 
zero-phonon lines (ZPLs) ranging from 1009 
to 1025 nm (Fig. 1D), along with longer- 
wavelength phonon sidebands. The minor 
ligand modifications in 1-3 also result in 
distinctive ground-state spin structures, as 
observed in ground-state electron spin reso- 
nance (ESR) measurements (Fig. 1G), with 
extracted values of D and E lying in the readily 
addressable regime of <5 GHz for each com- 
pound [because the signs of D and E are not 
determined and only their magnitudes influ- 
ence our experiments, we take D, E > O for 
clarity (23)]. These features, along with optical 
lifetimes (3.3 to 6.9 us, Fig. IF) that are much 
shorter than 7; [see below and (23)], therefore 
suggest that 1-3 satisfy the above criteria for 
optically addressable molecular qubits while 
exhibiting variable optical and spin properties. 

To further confirm the level structure in Fig. 
1A, we measure the emission of 1 under a high 
magnetic field using off-resonant excitation 
(Fig. 1E). Owing to the S = O excited state, the 
Zeeman splitting of the ground state manifests 
directly as a shift in the optical emission en- 
ergies. This effect is clearly shown by taking 
the difference in PL spectra at 9 and O T: Op- 
tical emission into the |+1) spin sublevels shift 
to lower and higher energies, giving character- 
istic peaks on either side of the zero-field ZPL in 
the differential spectrum (Diff. PL in Fig. 1K), 
along with a central dip [the feature at 1030 nm 
arises from the vibrational sideband (2.3)]. 

To demonstrate an optical-spin interface in 
these systems, we focus on 1 as an illustrative 
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Fig. 2. All-optical ground-state spin initialization and readout of 1. (A) Energy-level structure showing 
optical spin initialization through spin-selective excitation. (B) Photoluminescence excitation (PLE) spectrum 
obtained by sweeping a narrow-line laser over the zero-phonon line. The dashed line shows the excitation 
wavelength used for all following experiments. (Inset) Dependence of the PL on laser polarization, defined by 
the angle © from the crystal long axis. (C) Phonon sideband under resonant and off-resonant excitation 
showing emission line narrowing. (Inset) Schematic of subensemble excitation. (D) Time-resolved optical spin 
initialization. (E) All-optical measurement of the spin-lattice relaxation time (T4). 


example before discussing 2 and 3. Usinga | band to remove excitation laser scatter. First, 


narrow-line laser, we resonantly excite the S = 
1 ground state to the S = O excited state (Fig. 
2A) and collect emission into the phonon side- 
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we characterize the emission as a function 
of the excitation wavelength (Fig. 2B), show- 
ing a ZPL at 1025 nm: We excite at this ZPL 
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Fig. 3. Optically detected magnetic resonance (ODMR) and coherent spin 

manipulation of the ground state of 1. (A) ODMR as a function of magnetic field 
and microwave frequency using continuous-wave optical excitation. Dashed lines are 
a simulation with the stated g and D values. (B) Pulsed ODMR and (C) Hahn-echo 


maximum (dashed line Fig. 2B) for all follow- 
ing experiments. To further maximize emis- 
sion, we align the excitation polarization with 
the optical dipole transition, which is collinear 
with the long axis of the needle-like crystal 
(Fig. 2B, inset). The optical inhomogeneous 
linewidth of ~150 GHz shown in Fig. 2B is 
>>D and thus appears prohibitive for spin- 
selective excitation; however, resonant excita- 
tion addresses only a subensemble of molecules 
from the inhomogeneous distribution. Whereas 
the inhomogeneous linewidth is broadened by 
static energy-level variations, likely caused by 
strain, the subensemble linewidth is limited 
only by dynamical processes (e.g., electron- 
phonon coupling) (25). To demonstrate that 
the subensemble linewidth is indeed much 
narrower than the inhomogeneous linewidth, 
we compare the phonon sidebands under re- 
sonant excitation and off-resonant excitation 
(Fig. 2C). The emission line narrowing (26) 
under resonant excitation indicates that the 
ensemble ZPL indeed consists of narrower 
subensembles, which we use for all following 
spin-selective experiments. 

We next measure all-optical initialization 
and readout of the ground-state spin using 
hole burning and recovery. To initialize the 
spin, we apply the pulse sequence outlined 
in Fig. 2D, consisting of a long optical pulse 
(2 ms) followed by a wait time (2 ms) to 
equilibrate ground-state spin populations be- 
fore the next pulse. The emission during the 
optical pulse shows the characteristic behavior 
of optical spin polarization: a gradual drop in 
emission as population is pumped from the 
probed ground-state spin sublevel (the “bright” 
state) and into the other (“dark”) spin sub- 
levels. The optical contrast between the start 
and the end of the pulse places a lower bound 
on the spin polarization of 14% (23). 

Using this spin initialization, we measure 
the ground-state spin-lattice relaxation time, 
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T,, by performing the two-pulse experiment 
outlined in Fig. 2E. This sequence consists 
of an initialization pulse (300 us), a variable 
relaxation time, and a readout pulse (20 us). 
The initialization pulse transfers population 
to the “dark” spin sublevels. As ground-state 
spin population relaxes back to the “bright” 
sublevel, the emission increases. Measuring 
this emission at variable relaxation times 
ylelds 7, = 0.22(1) ms. That 7; is much 
longer than the optical lifetime (7 1 = 3.3 us, 
Fig. 1F) confirms that many optical cycles 
can be used to accumulate ground-state spin 
polarization. 

We next manipulate the ground-state spin 
of 1 using a microwave field. First, using 
continuous-wave (cw) optical excitation, we 
place a subensemble of spins in the “dark” 
state and monitor changes in emission (APL) 
as we sweep the microwave frequency. When 
this microwave frequency matches the spin 
sublevel splitting, the “dark” and “bright” sub- 
levels are mixed, resulting in increased PL. 
Figure 3A shows this optically detected mag- 
netic resonance (ODMR) as a function of both 
the microwave frequency and an external mag- 
netic field applied along the long axis of the 
crystal. The zero-field cw-ODMR spectrum 
provides D = 3.63 GHz, and the Zeeman split- 
ting yields a g-factor of 2.0, in agreement with 
the ESR measurements (Fig. 1G). 

To demonstrate coherent control over the 
ground-state spin, we drive Rabi oscillations 
(Fig. 3D) using the pulsed ODMR sequence 
outlined in Fig. 3B. This sequence consists of 
an optical initialization pulse, a wait time, a 
variable length microwave pulse, and an op- 
tical readout pulse. The inset shows the ex- 
pected square-root dependence of the Rabi 
frequency on the applied microwave power. 
Next, using az pulse calibrated from Fig. 3D, 
we perform pulsed ODMR at a fixed magnetic 
field, By = 10 mT, while varying the microwave 
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sequences. (D) Rabi oscillations between the |O) and | — 1) spin sublevels (Bg = 10 mT). 
(Inset) Microwave-power dependence of the Rabi oscillation frequency. (E) Pulsed 
ODMR spectrum (Bg = 10 mT) and double Lorentzian fit (black). (F) Optically detected 
ground-state spin coherence (By = 2 mT) with exponential fit (black). 


frequency (Fig. 3E). Finally, by replacing the 
single microwave pulse in Fig. 3B with a Hahn- 
echo sequence (Fig. 3C), we measure the spin 
coherence time 7 = 640(60) ns (Fig. 3F, Bo = 
2 mT). The final 2/2 pulse in the sequence 
projects the coherences onto spin populations 
for optical readout. In these pulsed ODMR 
experiments, the wait time (10 us ~ 37 pt) 
between initialization and microwave mani- 
pulation ensures population is in the ground 
state prior to coherent control. This wait time, 
along with the above agreement between the 
ODMR and ESR spin parameters, shows that 
we coherently control the ground-state spin. 
Furthermore, the measured T», likely limited 
by the surrounding hydrogen nuclear spins, is 
comparable to other transition-metal-based 
molecular qubits in nuclear spin-rich envi- 
ronments (15, 27), and we expect that ligand 
deuteration should greatly enhance T (27, 28). 
Thus, with 1, we demonstrate optical initializa- 
tion, microwave coherent control, and optical 
readout of the ground-state spin in an en- 
semble of molecular qubits. 

Having demonstrated an optical-spin inter- 
face and coherent spin control for 1, we high- 
light how this functionality is not specific to 
a single compound. In Fig. 4, we show optical 
initialization, microwave spin manipulation, 
and optical readout of 2 and 3 through cw- 
ODMR. As captured by the simulations, the 
variable peak intensities arise from ESR se- 
lection rules (23). Notably, the additional ligand 
methyl groups in 2 and 3 lead to substantial 
E by lowering the crystal symmetry, relative 
to 1, indicating that noise-insensitive (.e., 
clock-like) transitions could be used to en- 
hance spin coherence in these compounds 
(29). These results also suggest that modifica- 
tion of the host material provides a further 
opportunity to manipulate ground-state spin 
structure. Compounds 1 to 3 illustrate that 
minor synthetic modifications provide variable 
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Fig. 4. Optical spin addressability with synthetic tunability. (A to C) cw-ODMR spectra and simulations 
(black) for 1-3, with microwave transitions and ligand modifications depicted. 


magnetic and electronic structures, indicating 
potential for designing compounds with multi- 
ple, separately addressable qubit sites. 

Although small molecular ensembles may 
find applications for quantum sensing, exten- 
sion of these results to the single-molecule 
level will be important for their implementa- 
tion as qubits. We anticipate single-molecule 
sensitivity could be achieved by reducing ra- 
diative lifetimes through coupling to a photo- 
nic cavity and enhancing photon collection 
efficiency. Furthermore, we expect that initial- 
ization and readout fidelities could be im- 
proved by enhancing the spin selectivity of the 
optical excitation. Such enhancement could 
be achieved by engineering compounds with 
larger zero-field splittings, or narrower sub- 
ensemble optical linewidths (23). 

We envisage several promising avenues for 
optically addressable molecular spin systems, 
which may be more challenging to achieve 
with solid-state defects. First, multimetallic 
systems could be synthesized with interacting 
metal sites that exhibit distinct optical-spin 
interfaces, enabling both individual qubit ad- 
dressability and multiqubit operations. Tar- 
geted nuclear spin placement around these 
metal centers could further provide long-lived 
quantum memories (30). Alternatively, ligand 
functionalization and extension of these re- 
sults to biocompatible metal ions (37) may 
enable sensing of targeted analytes. Further- 
more, self-assembly could facilitate integra- 
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tion with photonic or electronic devices (32). 
Finally, the highly tunable nature of mole- 
cules offers promise for rationally designed 
properties tailored to a specific purpose, e.g., 
telecom emission for long-distance quantum 
networks, or strong spin-phonon interactions 
for hybrid quantum systems. These results 
highlight pathways to design and create quan- 
tum technologies from the bottom-up. 
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Reversible planar gliding and microcracking in 
a single-crystalline Ni-rich cathode 


Yujing Bi‘, Jinhui Tao’, Yugin Wu2”, Linze Li’, Yaobin Xu’, Enyuan Hu’, Bingbin Wu, Jiangtao Hu’, 
Chongmin Wang", Ji-Guang Zhang’, Yue Qi*?, Jie Xiao’?* 


High-energy nickel (Ni)-rich cathode will play a key role in advanced lithium (Li)-ion batteries, but it suffers 
from moisture sensitivity, side reactions, and gas generation. Single-crystalline Ni-rich cathode has a great 
potential to address the challenges present in its polycrystalline counterpart by reducing phase boundaries 
and materials surfaces. However, synthesis of high-performance single-crystalline Ni-rich cathode is very 
challenging, notwithstanding a fundamental linkage between overpotential, microstructure, and 
electrochemical behaviors in single-crystalline Ni-rich cathodes. We observe reversible planar gliding and 
microcracking along the (003) plane in a single-crystalline Ni-rich cathode. The reversible formation of 
microstructure defects is correlated with the localized stresses induced by a concentration gradient of Li atoms 
in the lattice, providing clues to mitigate particle fracture from synthesis modifications. 


or long-range electrical vehicles, next- 

generation Li-ion batteries that use stable 

high-energy cathode materials are needed 

(1). Among different cathodes, Ni-rich 

NMC (LiNi,.Mn,,Co,_7-,O2, @ > 0.6), with 
>200 mA-hour/g capacity, high voltage (>3.8 V), 
and low cost, are considered as one of the 
most promising candidates (2). Traditional 
LINiy/3Mny/3Co1/302 cathodes are prepared 
using a coprecipitation method that aggre- 
gates nano-sized primary particles into micron- 
sized secondary particles. The agglomerated 
polycrystalline NMC have shortened diffusion 
lengths within their primary particles and an 
increased number of pores, which accelerates 
Li* transport. This has thus become the most 
popular morphology for conventional NMC 
particles (3). 

Creating spherical secondary polycrystalline 
NMC particles reduces the surface/volume 
ratio, but pulverization along the weak inter- 
nal grain boundaries are generally observed 
after cycling (4). These cracks are induced by 
the nonuniform volume change of primary 
particles during cycling and exacerbated by 
the anisotropy among individual particles and 
grains in the polycrystalline NMC (5). The 
intergranular cracking exposes new surfaces 
to electrolyte for side reactions, which accel- 
erates cell degradation (6). As the Ni content 
increases to more than 0.6—e.g., LINip sgMno ; 
C09 10.—usable capacity is further increased, 
but new challenges such as moisture sensi- 
tivity (7), aggressive side reactions, and gas 
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generation (8) during cycling are found, all of 
which initiate from particle surfaces. 

A feasible approach for stabilizing Ni-rich 
NMC is to synthesize single-crystalline cath- 
odes with reduced surface areas, phase bound- 
aries, and more integrated crystal structures, 
similar to LiCoO, (9). Single-crystalline NMC 
with Ni < 0.6 that shows good performance 
has been demonstrated (10-17). However, 
the capacity of those single-crystalline NMC 
(Ni < 0.6) is relatively low, owing to the limited 
Ni content. Single-crystalline Ni-rich cath- 
odes with Ni > 0.6 substantially increase the 
capacity but are not much reported (8, 19) 
because of the challenges of preparing high- 
performance Ni-rich NMC single crystals. Owing 









to low structure stability of Ni-rich cathode at 
high temperatures, higher-Ni-content cathodes 
require lower synthesis temperatures, in con- 
trast to the high-temperature and time consum- 
ing calcination process required to grow single 
crystals. Synthesis of stoichiometric single crys- 
tals with competitive electrochemical properties 
is a challenge (20). 

This work uses high-performance single- 
crystalline LiNig 76MNg 14009402 (NMC76) as 
a model material to study how the potential 
triggers the structural changes of single crys- 
tals from the atomic to micron scale and its 
implications for the electrochemical proper- 
ties of cathodes. In situ atomic force micros- 
copy (AFM) and theoretical modeling are used 
to understand the coupled electro-chemo- 
mechanical behaviors of single crystals. 

Synthesized NMC76 has an average particle 
size of 3 um (Fig. 1A). A cross-sectional view 
(Fig. 1B) shows that NMC76 has a dense 
structure without cavities or grain bounda- 
ries. Pure phases of a-NaFeO.-type layered 
structures are confirmed by both selected 
area electron diffraction (SAED; Fig. 1C) and 
x-ray diffraction (XRD; Fig. 1D). Lattice param- 
eters a and c are 2.8756(1) and 14.2221(1) Å, 
respectively, from Rietveld refinement (see de- 
tails in table S1). For comparison, polycrystalline 
NMTC'6 is found to contain many internal pores 
and intergranular boundaries along with sur- 
face films (fig. S1) formed from the reactions 
between NMC and air (7). By contrast, the 
surface of single-crystalline NMC76 is uniform 
(Fig. 1, E and F). Elemental mapping (Fig. 1, G 
and H) indicates a homogeneous distribution 


o Measured 
į — Calculated 
— Difference 
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Fig. 1. Characterization of single-crystalline LiNip 7¢Ming.14C09 102. (A) SEM image of single-crystalline 
NMC76. (B) Cross-sectional image of single-crystalline NMC76. (C) Electron diffraction pattern of a selected 
area of single-crystalline NMC76. (D) Synchrotron XRD and Rietveld refinement pattern. (E) High-resolution 
high-angle annular dark-field (HAADF)-STEM image of single-crystalline NMC76 [corresponding to the region 
enclosed by the blue square in (B)]. (F) Higher-magnification view corresponding to the region enclosed by 
the red square in (E). (G) Energy dispersive spectroscopy (EDS) elemental mapping of Ni, Mn, Co, and O. 
(H) EDS overlapped image and line scanning shows the elemental distribution intensity (inset). 
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of Ni, Mn, and Co with a stoichiometric ratio 
as designed (fig. S2 and table S2). Continuous 
phase transitions happen when potential changes 
(fig. S3), similar to polycrystalline NMC76. 

Single-crystalline NMC76 is further tested in 
graphite/NMC full cells at realistic conditions. 
The typical loading of NMC76 cathodes is about 
20 mg/cm” (4 mA-hour/cm”) with about 32% 
porosity, which is needed to build a 250 W-hour/ 
kg Li-ion cell (See table S3). Harlow et al. pres- 
ented a wide range of testing results of Li-ion 
batteries based on graphite/LiNig;Mng 3C09 205 
chemistry (16), which can serve as benchmarks 
for academic research. The electrode properties 
in terms of loading and so on in (J6) are similar 
to ours because both are derived from realistic 
pouch cell parameters. At such a high cathode 
loading, Li metal will worsen the cycling sta- 
bility (figs. S4 and S5) owing to the deepened 
stripping and deposition process of Li (27). 
Between 2.7 and 4.2 V (versus graphite), single- 
crystalline NMC76 delivers 182.3 mA-hour/g 
discharge capacity at 0.1 C and retains 86.5% of 
its original capacity after 200 cycles (Fig. 2A). 
With a cutoff of 4.3 V, single-crystalline NMC76 
delivers 193.4 mA-hour/g capacity with 81.6% 
capacity retention after 200 cycles (Fig. 2A,). 
Upon further increasing to 4.4 V, a 196.8 mA: 
hour/g discharge capacity is seen (Fig. 2A3) 
along with a 72.0% capacity retention after 
200 cycles. Note that 200 cycles at a C/10 
charge rate and a C/3 discharge rate mean 
2600 hours of cycling. The total testing time 
is equal to a cell undergoing 1300 cycles at 1C. 
To evaluate the electrochemical properties of 
materials, it is important to use a relatively 
slow rate rather than a very fast rate, which 
produces seemingly “longer” cycles but may 
“hide” critical electrochemical information. 
It is the total time of charge-discharge that 
matters, not cycling number (22). Increased 
polarization (Fig. 2, B, to Bs) is observed 
when the cutoff voltage increases, which is 
presumably assigned to intensified electro- 
lyte decomposition at increased voltages (figs. 
S6 to S8) and thus higher impedance resulting 
from cathode passivation films and single-crystal 
lattice change. Table S4 summarizes the electro- 
chemical performances and testing conditions of 
all previously published single-crystalline Ni-rich 
NMC (Ni > 0.6) cathode materials. 

Lattice gliding is observed in single-crystalline 
NMC7%6 at high voltages. Between 2.7 and 4.2 V 
(versus graphite), the entire single crystal is 
well maintained after 200 cycles (Fig. 2C). 
Upon increasing the cutoff voltage to 4.3 V, 
there are some gliding lines seen on the crys- 
tal surfaces after 200 cycles (Fig. 2C,). Cycled 
to 4.4 V, single crystals appeared to be “sliced” 
(Fig. 2Cs and fig. S9) in parallel, along the 
(003) plane and vertical to the c axis of the 
layered structure (Fig. 3C), which indicates a 
model II-type crack (in-plane shear) in frac- 
ture mechanics. Additionally, small cracks that 
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Fig. 2. Electrochemical performance of single-crystalline LiNio76MNo.14C00.102 and SEM images after 
cycling stability tests. (A, to A3) Cycling stability of single-crystalline NMC76 in full cells within different 
electrochemical windows. The top grayish lines and solid black lines represent coulombic efficiencies and 
discharge capacities at different cycles of the cells, respectively. The dashed partial lines indicate the 
capacities at the 200th cycles, with capacity retention shown underneath the dashed lines. (By to B3) 
The corresponding charge-discharge curves of cells in (A;) to (A3). (C1 to C3) SEM images of a single crystal 
after 200 cycles in (A;) to (A3). The orange arrows indicate gliding marks or microcracks found in cycled 


singe crystals. mAh, milliampere hour. 


indicate a model I-type fracture (opening) are 
also observed at 4.4 V (Fig. 2C3). All character- 
izations have been done by selecting various 
regions of NMC76 electrodes, and the same 
phenomenon is repeatedly found (figs. S10 to 
S12). Although single-crystalline NMC76 as 
a whole particle is still intact (Fig. 3A), gliding 
is the major mechanical degradation mode, 
especially when the cutoff voltage is above 
4.3 V. Of note, the “gliding steps” formed in 
cycled crystals are quite different from crack- 
ing along intergranular boundaries of poly- 
crystalline NMC particles (23). The scanning 
transmission electron microscopy (STEM) 
image for single-crystalline NMC76 (Fig. 3, 
B to D) confirms that on both sides of a 
gliding plane (yellow line in Fig. 3D), the d- 
spacing of the (003) plane (0.48 nm) is un- 
changed and the layered structure is well 
maintained after the gliding marks occur. 
The long-range lattice symmetry of the bulk 
material will therefore not be altered (fig. S13). 
Ni, Mn, Co, and O are still uniformly distrib- 
uted in the vicinity of glided planes based on 
electron energy-loss spectroscopy (EELS) anal- 
ysis (Fig. 3F and fig. S14). The uniform element 
distribution and intimately attached lattices 
across the gliding planes demonstrate that 
although planar gliding occurs, no new bound- 
ary is generated, and the sliced area maintains 
the same lattice structure and chemical con- 
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ditions as in the bulk phase. The gliding line 
(or the slicing marks) cannot be observed on 
the cross section of bulk particles by SEM and 
are only visible by bright-field STEM on thinly 
sliced TEM samples. Although the internal 
lattice symmetry is well maintained after the 
gliding, the repeated gliding near the crystal 
surface eventually will evolve into microcracks, 
exposing new surfaces to the electrolyte (Fig. 2). 

To further induce lattice gliding, the cutoff 
voltage of the NMC’76 single crystal is in- 
creased to 4.8 V (versus Li*/Li). Slicing marks 
and microcracks are present in almost every 
charged single crystal (fig. S15A). A slight 
deformation of individual single crystals is 
observed (Fig. 3G), probably because the 
gliding of each layer is equally likely to move 
toward symmetrically equivalent directions 
(24). Surprisingly, after discharging back to 
2.7 V, the majority of single crystals revert to 
their original morphologies and the previously 
observed steps and microcracks disappear (fig. 
S15B). The glided layers within single crystals 
almost completely glided back to their original 
locations (Fig. 3H), fully recovering from the 
deformation (Fig. 3G), although some “traces” 
are visible (Shown in Fig. 3H). Within the “reg- 
ular” electrochemical window of 2.7 to 4.4 V 
(versus graphite), after extensive cycling, lat- 
tice gliding and microcracking are also seen 
within the crystal lattice at charged status (Fig. 
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31). STEM analysis of the NMC76 crystal (Fig. 
3J) indicates that the microcracks initiate from 
inside of the crystal. At the discharge status of 
those cycled crystals (cut off at 4.4 V), few 
ridges or cracks are found on the crystals. 
Although not as visible as in charged crystals, 
STEM still uncovers some slicing marks (Fig. 
3K) on discharged single-crystalline NMC76, 
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which probably undergoes a reversible sliding 
process back and forth during 120 cycles. No 
microcracking is identified in those “self- 
healed” single crystals (Fig. 3L), suggesting 
that the lattice gliding and cracking in some 
of the crystals are still reversible after 120 
cycles. As cycling continues, particle defor- 
mation will become dominant. Dislocation is 
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Fig. 3. Morphology and structure 
study of single-crystalline NMC76. 
(A) SEM image of single-crystalline 
NMC76 after 200 cycles. (B) Cross- 
sectional bright-field STEM image of 
single-crystalline NMC76 after 200 
cycles. (C) Bright-field STEM image of 
internal slicing. (D) HAADF-STEM 
image from around the slicing area. 
The upper inset is a magnified image of 
the gliding area enclosed by the red 
square. (E) SAED of the gliding area. 
(F) EELS mapping of the region 
enclosed by the dashed rectangle in 
(B). (G) SEM image of single-crystalline 
NMC76 initially charged to 4.8 V 
(versus Li”/Li). (H) SEM image of 
single-crystalline NMC76 discharged to 
2.7 V (after being charged to 4.8 V 
versus Li*/Li). The region enclosed 

by the red rectangle shows that the 
planes glided back with the traces seen. 
(I and J) STEM images of single- 
crystalline NMC76 at 4.4-V charge 
status (cycled in a full cell between 

2.7 and 4.4 V for 120 cycles). 

(K and L) STEM images of single- 
crystalline NMC76 at discharge status 
(cycled in a full cell between 2.7 and 
4.4 V for 120 cycles). The red arrows 
indicate the gliding marks inside the 
single crystal. 


also observed near the tip regions of the 
microcrack of single crystals charged at 4.4 V 
(fig. S16). The accumulation of dislocation is 
accompanied by the microcrack propagation 
(25). A trace amount of nano-sized NiO-like 
rock salt phase (fig. S17) is observed on the 
gliding exposure step area of single-crystalline 
NMC’76 after cycling. 
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In situ AFM has been used to image the crys- 
tal surface in real time in an electrochemical 
cell. A ~3-um-sized NMC76 single crystal is 
studied by in situ AFM during charge and 
discharge (Fig. 4). Regions B and C in Fig. 4A 
are enlarged in Fig. 4, B and C, respectively, to 
probe the origin and evolution of gliding steps 
and microcracks under the electrical field. The 
formation of nanosized crack domains is ob- 
served on the side surface from the open- 
circuit voltage (OCV) to 4.50 V (versus Li*/Li) 
during charge, whereas these domains dis- 
appear in the discharge process (Fig. 4B). 
Moreover, planar gliding is characterized 
by the appearance of wide crystal steps on 
the side surface owing to the uneven move- 
ment between neighboring layers during po- 
larization. Wider gliding steps are observed on 
the side surface starting at a 4.20 V charging 
process and lead to more and wider (~85 nm) 
gliding steps at 4.50 V (Fig. 4C). When the cell 
potential decreases to 4.19 V, a few wide gliding 
steps decrease in their width (fig. S18), indicat- 
ing that the atomic layer recovers back to its 
original position (Fig. 4C). In situ AFM show- 
ing the reversible and continuous morpholog- 
ical changes during charge and discharge is 
shown in movie S1. The reversible gliding 
process is further illustrated in Fig. 4F. The 
observed lattice gliding is a direct observa- 
tion of the lattice-invariant shear (LIS) (24). 
LIS should exist in many layered electrode 
materials, which experience stacking-sequence- 
change phase transformations owing to Li 
concentration change. It was also predicted 
that LIS will lead to particle deformation and 
ridges on the particle surface, but these 
signals are likely to be buried in the internal 
boundaries in a spherical secondary polycrys- 
talline NMC particle. The micron-sized single 
crystal provides a clear platform to observe 
gliding or LIS-induced mechanical degradation. 

Electrochemical potential difference is the 
driving force of Li-ion diffusion and the for- 
mation of the Li concentration gradient. Stress 
will be generated during Li* diffusion after 
establishing a Li concentration gradient in the 
lattice. An analytical cylindrical isotropic diffusion- 
induced stress model is applied to determine 
the stress generation when Li ions diffuse 
along the radial direction in the particle (fig. 
S19). The peak tensile stress along the tangen- 
tial and axial directions occurs near the sur- 
face at the onset of delithiation during charge 
(fig. S20). Conversely, the peak tensile stress 
in all three directions occurs at the center of 
the particle (fig. S21) during lithiation. During 
charge (delithiation), the tensile stress along 
the axial and tangential directions is local- 
ized on the surfaces of single crystals, lead- 
ing to microcrack opening normal to the (003) 
planes. 

Local stress also has a shear component 
along other directions, which is solved numer- 
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Fig. 4. Surface structure and morphology evolution by in situ AFM and mechanical analysis for 
single-crystalline NMC76. (A) AFM image at the OCV state. (B and ©) Comparison of selected surface 
evolution during in situ AFM testing of regions B and C in (A), respectively. (D) COMSOL-simulated shear 
stress along the yz direction during charge (delithiation) at scaled time of 0.17. (E) COMSOL-simulated shear 
stress along the yz direction during discharge (lithiation) at scaled time of 0.1T. (F) Schematic illustration 
of the structural evolution of single-crystalline NMC76 upon cycling. 


ically via COMSOL. The shear stress com- 
ponent along the yz direction that can trigger 
the gliding along the (003) planes is shown 
in Fig. 4, D and E. Although the signs of the 
shear stress during lithiation and delithiation 
are opposite, which explains the reversible 
gliding, the absolute values are not the same 
(fig. S22), because the elastic modulus is a 
function of Li concentration (26). Therefore, 
the gliding motion should be largely, but not 
completely, reversible. The irreversible glid- 
ing can generate small damages, being accu- 
mulated into the crack opening over a long 
cycling time, an analog of fatigue crack nu- 
cleation. These lead to the ridges and micro- 
cracks seen on the surfaces of single crystals 
after cycling. 

The simple isotropic diffusion-induced-stress 
model can be used to predict if the cracks can 
be stabilized inside of the single crystal. Be- 
cause the strain energy inside the particle 
reaches a maximum around the scaled time 
of T, = 0.1T during delithiation (fig. $20), its 
comparison with the fracture energy (2y) is 
used as a criterion to evaluate the critical size 
of single-crystalline NMC76 (26). If the ac- 
cumulated strain energy is not large enough 
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to entirely cleave the crystal, the microcrack 
will be stabilized inside of the particle. 





2 haC- 
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where A is the height of the cylindrical par- 
ticle, a is the concentration expansion co- 
efficient, Ey is Young’s modulus of the 
nonlithiated particle, EF is Young’s modulus at 
a given Li-ion concentration, Cp is the Li-ion 
concentration at the surface, Cp is the Li-ion 
concentration at the center, v is Poisson’s 
ratio, € represents the dimensionless stress 
(figs. S19 to S21), and y is the surface energy; 
these values are listed in the supplementary 
materials. A lower-bound estimation of the 
critical size of the single crystal is predicted to 
be ~3.5 um, below which cracks can be con- 
sidered stable inside of the particle. The simu- 
lation result suggests that although fractures 
along the (003) direction appear in single crys- 
tals during cycling, the cracks are stable once 
formed and will not initiate catastrophic 
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reactions to produce a fracture zone that 
eventually pulverizes the entire single crystal. 
Increasing the applied current density will 
lead to a higher concentration gradient and 
higher stress generation. Increasing the cut- 
off voltage is equivalent to increasing (Cp - 
Co) in Eq. 1. It means higher stress generation 
and large strain energies at increased volt- 
ages, which causes more gliding and cracking 
(Fig. 2). Our findings provide some strategies to 
stabilize single-crystalline Ni-rich NMC by 
either reducing the crystal size to below 3.5 um, 
absorbing accumulated strain energy through 
modification of the structure symmetry, or sim- 
ply optimizing the depth of charge without 
sacrificing much reversible capacity. 
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Protein condensates as aging Maxwell fluids 


Louise Jawerth’’, Elisabeth Fischer-Friedrich**, Suropriya Saha’, Jie Wang*+, Titus Franzmann““, 
Xiaojie Zhang”, Jenny Sachweh?+, Martine Ruer*, Mahdiye ljavi?§, Shambaditya Saha®, 
Julia Mahamid?, Anthony A. Hyman?’ Frank Jiilicher’>74 


Protein condensates are complex fluids that can change their material properties with time. However, an 
appropriate rheological description of these fluids remains missing. We characterize the time-dependent 
material properties of in vitro protein condensates using laser tweezer—based active and microbead-based 
passive rheology. For different proteins, the condensates behave at all ages as viscoelastic Maxwell fluids. 
Their viscosity strongly increases with age while their elastic modulus varies weakly. No significant 
differences in structure were seen by electron microscopy at early and late ages. We conclude that protein 
condensates can be soft glassy materials that we call Maxwell glasses with age-dependent material 
properties. We discuss possible advantages of glassy behavior for modulation of cellular biochemistry. 


embraneless compartments in cells 
exhibit various material properties 
adapted to their biological purpose. 
Some compartments such as P-granules 
and stress granules are liquid-like, as 
shown by fusion and rapid molecular rear- 
rangements (/, 2). Others such as centrosomes 
are less dynamic and can exhibit properties of 
amorphous gels (3, 4). Dynamic solids such as 
microtubules also exist (5). The study of rhe- 
ological properties of biological compartments 
provides important information about the 
nature of molecular interactions and about 
the emergent, collective behaviors of the con- 
stituent molecules. 

Under certain conditions, a protein solution 
in vitro can phase-separate to form protein- 
dense droplets (6, 7). Such droplets can behave 
as liquids. However, in many cases, the dy- 
namics of these liquid-like materials slow over 
time and behave more solid-like. This has 
been observed in vitro for a variety of evolu- 
tionarily distinct proteins involved in differ- 
ent biological functions (2, 3, 6-10) as well as 
in vivo (4, 9, 11-14). Such behavior has been 
termed hardening or aging (2, JO). 

We can distinguish several ways in which a 
liquid could become more solid-like (15-17). 
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One way is gelation through the introduction 
of cross-links. When the cross-links reach a crit- 
ical density, a connected polymer network is 
formed that extends through the whole system, 
and there is a sudden transition from a vis- 
coelastic fluid to a solid (17, 18). Another way 
for a liquid to become more solid-like is found 
in glass-forming systems, such as soft pastes 
and entangled polymer systems, which do not 
reach thermodynamic equilibrium (6, 19-21). 
Glassy systems can be distinguished from gela- 
tion by the fact that there is no sudden change 
of material properties with age (J8, 22). Gels 
and glasses are typically amorphous. Liquids 
can also become more solid-like by the for- 
mation of fibers or crystal-like aggregates such 
as amyloids (2, 23). Membraneless compart- 
ments in cells have been called condensates 
in part to emphasize their physical nature. How- 
ever, in general, we do not understand their 
physical properties. Probing material proper- 
ties of condensates requires understanding of 
the time-dependent rheology. 

We began by characterizing the change in 
protein diffusion in condensates as a function 
of their age tw (defined as time after forma- 
tion of droplets), using fluorescence recovery 
after photobleaching (FRAP) (Fig. 1A and movie 
S1). Diffusion of nonbleached protein into the 
bleached region is a measure of molecular mo- 
bility. Figure 1B shows typical recovery curves 
at early and late age, for the C. elegans pro- 
tein PGL-3 (fig. S1C) (24). At early age below 
30 min, fluorescently labeled PGL-3 fluores- 
cence recovers with a half-time of ~1 min. At 
age tw = 46 hours, recovery is slow with a 
half-time of ~50 min (Fig. 1B). A second mea- 
sure of dynamics is the coalescence time of 
two spherical condensates. Two condensates 
with liquid-like behavior merge into a final 
spherical condensate over a time that depends 
on the ratio of surface tension and viscosity 
(Fig. 1C) (1). After formation, coalescence of 
PGL-3 droplets occurs within 10 s, whereas at 
an age tw = 46 hours, it takes tens of minutes 
(Fig. 1D and movie S2). Similar behaviors 
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were seen for four other proteins of the FUS 
family [FUS, EWSR1, DAZAPI, and TAFI5 (10)] 
(Fig. 1, A and C; fig. S1, A and C; and Table 1). 
These results are similar to previously pub- 
lished data and demonstrate the aging behav- 
ior of a wide variety of protein condensates 
(2, 3, 6, 7, 9, 25). 

To characterize the mechanism by which 
the material properties of condensates change 
between early and late times after their for- 
mation, we used both active (26) and passive 
microrheology. For active microrheology, con- 
densates were deformed using two laser tweez- 
ers that acted on two beads linked to a droplet 
through attached protein (Fig. 1, E and F). 
Sinusoidal forces with a given frequency were 
exerted on one bead and the bead displace- 
ments were recorded, which served as a mea- 
sure of droplet deformation (Fig. 2A). At an 
early age, the force-displacement curve is an 
ellipse showing a strong viscous component. 
At a later age, the force-displacement curve 
is almost a straight line indicative of predomi- 
nantly elastic behavior (Fig. 1G). For passive 
microrheology, we followed the movement 
of fluorescent polystyrene beads immersed 
in droplets using a confocal microscope (Fig. 
1H and movie S3) (27, 28). We noticed that 
droplets decreased their size as they age (Fig. 
1H). Furthermore, at early age, the diffusive 
bead motion reached larger distances as com- 
pared with later ages within the same time 
window (Fig. 11). Despite the changes in mate- 
rial properties and droplet size for PGL-3 and 
FUS, cryo-electron tomography showed no de- 
tectable changes in structural features inside 
the drops, and the material appeared amor- 
phous at all times. Occasional fibrous structures 
existed at the surface of late FUS droplets but 
not in the bulk (fig. SIB). 

To further investigate the material proper- 
ties, we measured the frequency-dependent 
complex shear modulus G* = G' + iG" as a func- 
tion of the droplet age tw after droplet formation 
using active microrheology (26). The storage 
modulus G’ characterizes the elastic response 
of the droplets, and the loss modulus G” char- 
acterizes the viscous component of the re- 
sponse. For PGL-3, at early ages and for low 
frequencies œ, the loss modulus is larger than 
the storage modulus, indicating predominantly 
liquid-like behavior (Fig. 2B, blue symbols). 
At high frequencies, the elastic modulus sur- 
passes the loss modulus, indicating that at these 
frequencies, the mechanical response is pre- 
dominantly elastic. The crossing-over between 
predominantly elastic and predominantly 
viscous behaviors occurs at the angular fre- 
quency œc. At this frequency, the storage 
modulus G’ equals the loss modulus G”. The 
crossover frequency œc decreased as the sam- 
ple aged (Fig. 2C, red symbols). The frequency- 
dependent responses obtained at early and 
late waiting times were different. However, 


1 of 7 


RESEARCH | REPORT 


they are both well described by a Maxwell 
fluid albeit with different characteristic relaxa- 
tion times Tc (Fig. 2, B and C, gray lines). For 
a Maxwell fluid the crossover frequency is 
Wc = 1/Ttc, where Tc is the Maxwell relaxation 
time. The time scale tc characterizes the drop- 
let response to a deformation: For deforma- 
tions on time scales shorter than Tc, the droplet 
response is primarily elastic, whereas for de- 
formations on time scales longer than Tc, the 
response is primarily viscous and the droplet 
will flow. Therefore, our active microrheol- 


Fig. 1. Protein droplets exhibit age-dependent 
material properties. We compare the 
properties of droplets shortly after formation 
(tw = O min, early droplets) with droplets 
observed after a time ty of several hours of 
formation (late droplets). (A) FRAP of protein 
droplets composed of PGL-3-GFP and 
FUS-GFP, respectively. Shown are images of 
early and late droplets immediately after 
bleaching (At = O min) as well as after 

a time At after bleaching. (B) Fluorescence 
intensity of the photobleached area for 

the PGL-3-GFP droplets shown in (A) for 
both early (blue) and late (red) droplets. 

(C) Images of FUS-GFP and PGL-3-GFP 
droplets undergoing coalescence. Shown are 
images at the time of contact At = 0 

as well as at a time At after coalescence. 

(D) Aspect ratio of the coalescing PGL-3-GFP 
droplets shown in (C) is shown as a function of 
time for early (blue) and late (red) droplets. 
The aspect ratio is defined as the ratio of the 
major to minor axes of a representative 
ellipse. (E) Schematic representation of the dual 
optical trap active microrheology experiment. 
Polystyrene beads are shown in yellow, and the 
droplet is shown in blue. One trap is fixed, and 
one trap oscillates. (F) Confocal image of a 
PGL-3-GFP droplet with two 2 um-diameter 
beads in a dual optical trap. (G) Lissajous plot of 
force versus trap displacement for an early 
(blue) PGL-3 droplet indicating viscous behavior. 
The plot for the late (red) droplet indicating 
elastic behavior. (H) Confocal images of the 
midsection of a large PGL-3 droplet containing 
1 um-diameter fluorescent polystyrene 

beads early (blue) and late (red) after droplet 
formation. (I) Two examples of trajectories over 
3 hours of embedded beads in a PGL-3 
droplet, early (blue, tw = 3 hours) and late 
(red, tw = 36 hours) after droplet formation 
(length of inset box represents 0.5 um). Scale 
bars in all panels represent 5 um. 
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ogy data show that as the system ages, the 
time scale Tc increases, indicating that the drop- 
let exhibits elastic behaviors over an increas- 
ingly broad range of frequencies, corresponding 
to droplet aging. The scaling of frequency- 
dependent complex moduli is revealed by 
plotting the data with respect to a rescaled 
frequency axis. The data for different ages of 
PGL-3 droplets collapse on a single curve, which 
matches the behavior of a simple Maxwell fluid 
closely (Fig. 2D). We also performed active 
microrheology on protein condensates com- 


















posed of FUS protein and again found rhe- 
ology consistent with a Maxwell fluid. We 
repeated our measurements on PGL-3 under 
buffer conditions with varying salt concen- 
trations (fig. S3, A to C) and with different 
temperatures (fig. S3, F to H, and supple- 
mentary materials). These data reveal that 
with increased temperature or increased salt 
concentration, droplet aging slows and drop- 
lets appear liquid-like for a longer period of 
time. At a late age, an additional short-time 
viscosity à can be identified, which becomes 
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relevant at high frequencies and does not 
change significantly with age. This can be 
described by a Jeffreys fluid model (supple- 
mentary materials and fig. S8) (29). We de- 
termine A values between 6 mPa:s and 1.3 Pa:s, 
which we attribute to the solvent. For FUS 
protein, we also observe an age-independent 
high-frequency viscosity in the same range 
(Fig. 4B and fig. S8, B and C). We conclude 
that at all conditions, the droplets exhibit 
Maxwell fluid rheology with age-dependent 
relaxation time. 


Fig. 2. Active rheology of early and late 
PGL-3 droplets. A droplet is trapped between 
two beads using a dual optical trap; one trap is 
undergoing oscillatory displacement while the 
other trap is at a fixed position. (A) Force AF is 
Shown as a function of time (gray) together 
with a sinusoidal fit (black solid line). The 
corresponding droplet deformation Ad (green) 
is shown together with a sinusoidal fit (solid 
green line). We determine the force amplitude 
Fo, the deformation amplitude do, and the 
phase shift @ = wAt from the fits where w is 
the imposed angular frequency. (B) Storage 
and loss moduli, G' and G”, are shown as a 
function of frequency for an early droplet 

(tw = 0.5 hours with a surface tension of 

4.5 uN/m). The corresponding frequency- 
dependent moduli of a Maxwell element with a 
characteristic time scale te = 0.039 s and 
plateau modulus E = 56 Pa are shown in 
gray. Error bars represent measurement 
uncertainty (Supplementary materials). 

(C) Same plot as in (B) for the same droplet 
late after formation (tw = 1.5 hours with a 
surface tension of 19.3 uN/m). The complex 
elastic modulus of a Maxwell element with a 
characteristic time scale te = 0.39 s and 
plateau modulus E = 50.7 Pa is shown in 
gray. Error bars represent measurement 
uncertainty (Supplementary materials). (D) 
Scaled storage and loss moduli, G'/Ge and G’ 
'/Gc, are plotted as a function of scaled 
frequency wtc for the droplets shown in 

(B) and (C) (blue, red) and additional 
droplets (colors indicate characteristic time 
tc). Gc is the crossover modulus. A scaled 
Maxwell fluid is shown in gray. For all 

panels, the storage moduli G' are shown as 
solid triangles, and the loss moduli G" are 
shown as open triangles. (B to D) Droplets 
were formed in a solution with a final KCI 
concentration of 75 mM. 
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Using passive microrheology, we measured 
the mean square displacement over time of 
beads immersed in the droplets (Figs. 1H and 
3A and supplementary materials). We were 
concerned that the change in droplet radius 
would contribute to the mean square displace- 
ment of beads. Therefore, we first measured 
the droplet radius as a function of time (Fig. 
3B, orange line). Droplet shrinking can also be 
estimated on the basis of net particle motion 
(Fig. 3, A and B, blue line). These two measures 
show good agreement, with small discrep- 
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ancies at later ages (Fig. 3B, fig. S5A, and sup- 
plementary materials). The average particle 
displacement is proportional to distance from 
the droplet center, indicating that the drop- 
let is shrinking uniformly in space (Fig. 3A). 
We quantified the mean square displacement 


(Ar(t, tw)”) = ([r(tw +1) —r(tw)}’) (1) 


of the beads over time t for different droplet 
ages tw, Where r is the bead position in the 
image plane, after contributions from droplet 
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Fig. 3. Passive microrheology of early and late PGL-3 droplets. (A) Image of 
particles representing bead positions in the upper half of a droplet several hours 
after formation (tw = 27 hours). Beads in yellow are considered boundary particles 
and are not considered for bead-diffusion analysis. The blue outline represents the 
droplet border shortly after droplet formation; the red outline represents the droplet 
border at tw = 27 hours. Particle displacements 8x in the x direction resulting 
from droplet shrinkage are shown below for a time interval of t = 2.5 min at droplet 
age tw = 8 hours plotted with respect to x position in the droplet. The slope of these 
particle displacements determined by a linear fit (black line) characterizes average 
particle motion due to droplet shrinkage. Data points within a standard deviation of 
the linear fit are shown in red, the remaining ones in gray. (B) Scaled droplet radius, R/Ro, 
is shown as a function of droplet age tw, where Ro Is the initial droplet radius (orange). 
The scaled radius inferred from average particle displacements is shown in blue 
(supplementary materials). (C) Inverse of the diffusion coefficient D of embedded 
beads calculated from the mean square displacement during t = 20 min is shown as a 
function of waiting time ty. The characteristic relaxation time tc is shown in green. 
(D) Mean square displacement (Ar) as a function of lag time + shown for different 
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waiting times ty. (E) Double logarithmic plot of (Ar*) as a function of lag time + for 
different waiting times ty. (F) Plot of normalized mean square displacement, (Ar) /Aé, 
as a function of normalized lag time t/tc, where A‘ is the offset and tc is a char- 
acteristic time. The values of tc that correspond to the mean square displacements 
shown in (E) are chosen such that the data for different waiting times t,, collapse 
onto a single curve, which is well described by (Ar*) = AS (1 + =) (gray line). 


(G to I) Real and imaginary parts, Go and G’o, of the inferred complex shear moduli 
Go for waiting times tw = 3 hours (G), 38 hours (H), and 42 hours (l) (supplementary 
materials). (J) Normalized inferred shear moduli as a function of the normalized 
frequency wtc for all measurements of mean square displacement. The colors 
indicate the waiting time tẹ, and the color code is the same as in (D) and (E). In 
addition, the value of characteristic times tc is indicated. The values of Goc are 
inferred from the fit in (F) using the relationship Goc = kgT/ (3naAé). Inferred 
Shear moduli shown in (G) to (J) were approximated using Eq. 14 of the supple- 
mentary materials. The gray lines in (G) to (J) correspond to the inferred complex 
moduli for a single-relaxation-time Maxwell fluid using the same approximation. 
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Fig. 4. Dimensionless loss and storage moduli obtained by active and passive microrheology collapse onto 
single curves. (A) Shear moduli G and Go scaled by the respective crossover moduli Gc and Goc plotted as a 
function of dimensionless frequency wtc. The active microrheology data are shown in open and solid triangles. 
The moduli inferred from passive microrheology as shown solid colored lines. These lines were obtained using 
passive microrheology data (Fig. 3F and supplementary materials). For active and passive microrheology, the 
same color code as in Figs. 2, B to D, and 3, E to J, is used. The thick gray lines represent the frequency response 
of a Maxwell fluid. (Inset a) Crossover moduli for active and passive microrheology Ge (yellow triangles) and 
Goc (colored circles), respectively, are plotted as a function of characteristic time, tc. (Inset b) Viscosities for 
active and passive microrheology n = 2Gctc (yellow triangles) and no = 2Goctc (colored circles), respectively, are 
plotted as a function of characteristic time, te. Shear moduli inferred from passive microrheology were obtained 
using Eq. 18 of the supplementary materials. The gray lines correspond to the complex moduli of a single-relaxation- 
time Maxwell fluid. PGL-3 droplets were formed in a solution with a final KCI concentration of 75 mM. (B) Plot 
analogous to the upper plot with active and passive microrheology of FUS protein. FUS droplets were formed in a 
solution with final KCl concentrations of 75 mM and 250 mM for passive and active droplets, respectively. 
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shrinkage have been subtracted. We also cor- 
rected for the noise floor because of experimen- 
tal noise (supplementary materials and fig. S7). 
The angle brackets denote an average over beads 
and over an age window of 1 hour (supplemen- 
tary materials). We found that at early ages tw, 
the mean square displacement increases lin- 
early with time t and reaches ~0.4 um? within 
10 min (Fig. 3, D and E, blue lines). For increas- 
ing droplet ages, the slope of the mean square 
displacement decreases (Fig. 3, D and E). The 
diffusivity can be determined through the slope 
of the mean square displacement as a function 
of time (Ar(t, tw)”) ~ AG + 4D(tw)t, where As 
is an offset that is independent of t but can 
depend on age tw. D(tw) is an age-dependent 
effective diffusivity; inverse D(t,,) is shown in 
Fig. 3C, black line. As the system ages, D(tw) de- 
creases steadily but remains almost constant 
with D ~ 10 * um?/s for tw less than ~1000 min 
(15 hours). For passive, equilibrated samples, 
this value corresponds to a viscosity of ~1 Pas, 
which is consistent with previous estimates of 
PGL-3 droplet viscosity (26, 28). For ages t, 
greater than 1000 min, D(tw) decreases with 
age and reaches a value of D = 1.5-10 7 um?/s 
at ~2500 min (Fig. 3C). For passive liquids, this 
corresponded to a viscosity of ~10° Pa-s. When 
the mean square displacement curves are ex- 
amined on a logarithmic scaling, the curve 
reaches a finite value at low <q, indicative of an 
elastic regime (Fig. 3, E and F). Consistent with 
the active microrheology, the mean square dis- 
placement data collapse to a single curve well 
described by a Maxwell fluid, when rescaling 
the time t and (Ar?) by the age-dependent re- 
laxation time tc and the offset A; (Fig. 3, D to 
F, and supplementary materials). 

Passive microrheology assesses fluid proper- 
ties from mean square displacement mea- 
surements using the fluctuation-dissipation 
relation (27, 30). The fluctuation-dissipation 
relation applies to equilibrium systems. How- 
ever, an aging system has not reached equilib- 
rium. We reasoned that for sufficiently short 
time scales compared with the age tw quasi- 
equilibrium of many degrees of freedom may 
hold (37), and the fluctuation-dissipation rela- 
tion may provide useful estimates of mate- 
rial properties. We therefore determined the 
frequency-dependent complex shear modulus 
Go = Go + iG’ inferred from the fluctuation- 
dissipation relation using the mean square 
displacement data (supplementary materials). 
The moduli Go obtained this way qualitatively 
exhibit the frequency-dependent behaviors of 
a Maxwell fluid with age-dependent Maxwell 
times (Fig. 3, G to I). The frequency-dependent 
moduli Go(w) as a function of frequency at 
different age tw collapse on a single curve when 
the frequency axis is scaled with the time te 
and the moduli are scaled by the value Go,c, 
defined as the value of the real and imaginary 
parts of Gj at the point where both are equal 


5 of 7 


RESEARCH | REPORT 


(Fig. 3J). Consistent with the active micro- 
rheology data, the passive microrheology data 
also exhibit at all ages the qualitative behaviors 
of a Maxwell fluid (compare Fig. 2D with Fig. 
3J). We also performed passive microrheology 
on protein condensates composed of FUS pro- 
tein and PGL-3-green fluorescent protein (GFP) 
(figs. S5 and S6). Our bead-tracking measure- 
ments on these condensates again reveal at all 
ages the qualitative properties of a single-time 
Maxwell fluid. 

We highlight the similarity of active and 
passive microrheology by plotting the scaled 
shear moduli as a function of scaled frequency 
for both active (triangles) and passive (colored 
lines) microrheology in the same plot for both 
PGL-3 and FUS proteins (Fig. 4). The data col- 
lapse and are well captured by a single-time 
Maxwell fluid (Fig. 4, gray lines). When exam- 
ining the moduli Gc, obtained for active mi- 
crorheology taken from many aging droplets, 
we find values that vary over approximately 
one order of magnitude [Fig. 4, A and B, insets 
(a), yellow triangles]. The values of Go,c ob- 
tained by passive microrheology for one 
aging droplet exhibit a trend toward smaller 
values for larger tc [Figs. 3, G to I, and 4A, inset 
(a), colored circles]. We also estimated the long 
time viscosities, 7 = 2Gctc and No = 2Go,ctc, for 
active and passive microrheology, respectively 
[Fig. 4A, inset (b)]. The viscosity strongly in- 
creases with tc, and both active and passive 
microrheology share the same trend. To com- 
pare the inferred moduli determined by the 
active and passive microrheology, we define the 
ratio § = (Gc)/(Gco), where brackets denote 
the average over several droplets and several 
values of tc. Using the window of 0.004 s < 
Tc < 0.442 s, we find € ~ 2.63 + 2.57. One can 
interpret € as an effective noise strength that 
replaces the amplitude of fluctuations in the 
fluctuation-dissipation relation (32-35). The 
product éT is sometimes called an effective tem- 
perature. This suggests that the fluctuation- 
dissipation relation is only weakly violated, at 
least, at early ages similar to previous reports 
on soft glasses (32, 36). Our data further show 
that the elastic modulus does not tend to in- 
crease with time, suggesting that the term 


hardening is not appropriate for the change 
in material properties of condensates consist- 
ing of PGL-3 and FUS family proteins. Rather, 
we use the term aging. 

A defining and unexpected rheological fea- 
ture of the condensates we study is the self- 
similarity of their material properties as they 
age. This self-similarity is manifested by the 
fact that at all ages, they exhibit the same 
viscoelastic behavior of a Maxwell fluid. An 
ordinary Maxwell fluid does not age and 
reaches a thermodynamic equilibrium because 
it has a fixed relaxation time. However, the 
relaxation time of the protein condensates in 
this study increases with age. Age-dependent 
relaxation times are typical for glass-forming 
systems. We thus refer to a Maxwell fluid with 
age-dependent relaxation time as a Maxwell 
glass (supplementary materials). Our data sug- 
gest that the molecular components increas- 
ingly interact and lose their freedom to move 
as the condensates shrink and their density 
increases. Such behaviors are also seen in 
other glass-forming systems (15, 16, 37-40). 
The dependence of aging characteristics on 
salt concentration that we observe suggests 
that electrostatic interactions play a role in 
the aging of protein condensates (41). 

A key question will be to relate glassy behav- 
ior to molecular configurations. The associative 
polymer ideas of Semenov and Rubenstein are 
a beautiful and elegant way to understand 
viscoelastic and Maxwell-like behaviors of 
polymer systems and describe the equilibrium 
properties and kinetics of such systems (42). 
However, they do not address aging. An exam- 
ple of how dynamics can slow is jamming (43). 
However, this is a simplified picture based on 
excluded volume and rigid objects such as 
spherical particles. For condensate-forming 
proteins, a more appropriate picture may be 
based on the fact that these proteins have large 
regions of disorder (6). Disordered proteins can 
take many different molecular configurations 
leading to complex energy landscapes with 
distributions of energy minima (44). Increas- 
ing relaxation times are likely related to the 
system exploring deeper energy minima that 
are increasingly inaccessible, but longer lived, 


as the system ages (45-47). Such ideas have been 
put forward in the context of protein folding 
(44, 48, 49) but still need to be explored in the 
context of higher-order protein assemblies. 

In conclusion, we provide evidence that bi- 
ological condensates made from a variety 
of proteins can best be described as Maxwell 
glasses. At any particular age, the rheology 
reveals the expected viscoelastic properties 
of a Maxwell fluid, well described by a single 
relaxation time. However, at different ages, 
the relaxation times themselves are different. 
Therefore, the term Maxwell glass incorporates 
the aging of Maxwell fluids and glasses. The 
role of glass-like behaviors have not been well 
characterized in biology, which has tended to 
stress gel-like characteristics of biological 
materials (7, 50). This is because glass-like 
aging is a dynamic process and, therefore, 
more difficult to study because it requires a 
broad range of time scales. However, glass- 
like behavior offers a number of interesting 
possibilities for cellular function compared 
with a gel-like state. A soft gel will either re- 
spond elastically or break under stress and 
tends to require chemical modification for dis- 
assembly. On the other hand, because glasses 
can start to flow when mechanically stressed 
(5D, glass-like soft pastes offer a tension sensor 
to cells that can couple mechanical stresses 
to biochemistry. The nature of glass-like states 
makes it easy to fluidize a biological glass sim- 
ply by changing composition, temperature, or 
salt concentration. More generally, soft glasses 
allow cells to slow down biochemistry while 
maintaining a soft material, which can flex- 
ibly and rapidly respond to changing condi- 
tions. Little is known about the relationship 
between pathology and stiffness of aggregates, 
but stiffer structures likely could be more dis- 
ruptive to the mechanical environment of a 
cell. Cells in tissues are constantly responding 
to mechanical perturbation from other cells in 
the tissue and must be able to respond as soft 
materials. Glass-like aging of protein conden- 
sates may, therefore, offer cells a way to flexibly 
modulate mechanical properties of membrane- 
less organelles while allowing for rapid re- 
sponse to changing environmental cues. 


Table 1. Characteristics of the proteins used in this study. The first four proteins are from the FUS family. Protein length is reported in number of amino 
acids. All molecular weights and sequence lengths are reported for untagged proteins. The addition of a GFP tag adds 29 to 31 kDa in molecular weight and 
260 to 278 amino acids in length. The GFP tag is on the N terminus for DAZAP1, whereas all other proteins have the GFP tag on the C terminus. 


Molecular weight [Da] 


Plasmid name 


Protein name Length 
DAZAP1 407 
EWSRI 656 
FUS 526 
TAF15 589 
POISS 693 
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43,384 TH1272 
68,480 TH1259 
53,426 TH1204 
61,559 TH1258 
74,700 pSS2B 
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Virus number Gene 
1424 DAZAP1 
1510 EWSRI1 
3311 FUS 
1480 TAFIS 
1522 PGL3 
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ULTRACOLD CHEMISTRY 


Resonant collisional shielding of reactive molecules 


using electric fields 


Kyle Matsuda’, Luigi De Marco’, Jun-Ru Li’, William G. Tobias’, Giacomo Valtolina’, 


Goulven Quéméner, Jun Ye** 


Full control of molecular interactions, including reactive losses, would open new frontiers in 
quantum science. We demonstrate extreme tunability of ultracold chemical reaction rates 

by inducing resonant dipolar interactions by means of an external electric field. We prepared 
fermionic potassium-rubidium molecules in their first excited rotational state and observed 

a modulation of the chemical reaction rate by three orders of magnitude as we tuned the electric 
field strength by a few percent across resonance. In a quasi-two-dimensional geometry, we 
accurately determined the contributions from the three dominant angular momentum projections of the 
collisions. Using the resonant features, we shielded the molecules from loss and suppressed the reaction rate 
by an order of magnitude below the background value, thereby realizing a long-lived sample of polar 


molecules in large electric fields. 


ontrolling chemical reactions and colli- 

sions has been a central focus of work 

on cold and ultracold molecules (1-6). 

Progress in cooling and trapping mole- 

cules has led to exciting advances in this 
area, including the precise characterization of 
scattering resonances (7-9), the observation 
of atom-molecule (10) and molecule-molecule 
(11-13) cold collisions, and the synthesis of 
new chemical species (174). In particular, ultra- 
cold polar molecules, for which both internal 
and external degrees of freedom are controlled, 
present distinctive opportunities (15-26). At 
ultralow temperatures, small perturbations to 
the long-range intermolecular potential, although 
negligible relative to chemical bonding energy 
scales, can vastly exceed the kinetic energy of 
the colliding molecules and thus strongly alter 
the rate of chemical reactions at close range (3). 
This sensitivity, combined with the rich struc- 
ture of polar molecules and their tunability 
using external electromagnetic fields, suggests 
the exciting possibility of precisely controlling 
reactions (1, 4-6). In addition to providing 
insights about fundamental chemical processes 
(27, 28), such control would aid in the produc- 
tion of quantum-degenerate molecular gases 
(29-31) and could facilitate precision measure- 
ments (32) or studies of many-body physics 
(33) in these systems. 

Applying an electric field E strongly modi- 
fies the reaction rates of ultracold polar mole- 
cules via dipolar interactions (34, 35). In 
the ultracold regime, molecules collide pre- 
dominantly in the lowest partial wave L al- 
lowed by quantum statistics (Z = 1 for identical 
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fermions). Owing to the anisotropy of the 
dipolar interaction, the likelihood of two 
molecules meeting at short range depends on 
how they approach each other relative to the 
direction of the induced dipole, or more form- 
ally, on the projection mz of L onto the axis of 
E. For fermionic molecules in three dimen- 
sions (3D), attractive head-to-tail (mz, = 0) col- 
lisions lead to rapid losses (34), which scale 
as d° in the induced dipole moment d (36). If 
the molecules are instead trapped in a quasi- 
two-dimensional (quasi-2D) geometry with E 
along the strongly confined direction, only 
repulsive side-to-side (mz = +1) collisions are 
allowed, suppressing losses and enhancing 
the elastic collision rate (37, 38). For ground- 
state potassium-rubidium (KRb) molecules, 
we recently used this approach to achieve a 
ratio of elastic collisions to reactive collisions 
exceeding 100 (3D. 

Here, we experimentally demonstrate a 
striking effect of the electric field on molec- 
ular collisions: Chemical reaction rates in an 
ultracold gas of molecules are sharply varied 
by three orders of magnitude near particular 
values of the field strength |E|. These values 
occur where higher rotationally excited states 
become degenerate with the initial collision 
channel, inducing resonant dipolar interac- 
tions that profoundly alter the long-range 
potential and hence the reaction rate (39). 
Although losses can also be resonantly en- 
hanced, the most important effect is the sub- 
stantial suppression of loss for an appropriate 
choice of |E|. This shielding mechanism was 
first proposed by Avdeenkov et al. (40), and 
related theory was subsequently extended to a 
wide variety of bosonic and fermionic species 
of experimental interest (39, 41, 42). Alterna- 
tively, the use of microwave (43-47) or optical 
(48) dressing to suppress molecular loss has 
been proposed. However, microwave dressing 
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has so far only led to an enhanced loss rate in 
experiments (49, 50). 

We prepared ultracold fermionic *°K®’Rb 
molecules in the |N, 7m) = |1,0) state, where 
Nis the rotational angular momentum and 
My is its projection onto the axis of E. 
Throughout, |N, my)|N’, m'y) denotes the 
combined molecular state of a pair of mole- 
cules with one molecule in |N, my) and the 
other in |N', m'y). We observed a drastic change 
in the two-body reactive loss rate near two 
field strengths, |E,| = 11.72 kV/cm and |E,| = 
12.51 kV/cm, where the energies of |0,0) |2,+1) 
and |0,0)|2,0) (respectively) cross the energy 
of |1,0)|1,0) (Fig. 1A). Near these crossings 
(Fig. 1B), the nearly degenerate states are 
resonantly coupled by dipolar interactions, 
becoming strongly mixed as the molecules 
approach to separations r ~ rọ during a colli- 
sion event, where 79 = 270dp is the radius of 
the p-wave centrifugal barrier (57) and do is 
the Bohr radius. 

The consequence of this 7-dependent state 
mixing is apparent in the adiabatic energy 
curves near |E,| (Fig. 1C) (52). For |E] > |Eb] 
(orange line), the energy of |1,0)|1,0) is higher 
than that of |0,0)|2,0). Hence, coupling be- 
tween the states causes an increasing energy 
of |1,0) molecules as they approach, creating a 
repulsive barrier (with height ~ 300 uK) that 
is three orders of magnitude larger than the 
typical collision energy set by the temperature 
of the gas (250 nk). In this case, molecules 
are shielded from reactive losses, because they 
cannot meet at short range except by tunnel- 
ing through the barrier (which occurs with a 
low probability). Conversely, for |E| < |E,| 
(green line), the energy of |1,0)|1,0) is lower 
than that of |0,0)|2,0), resulting in an attract- 
ive interaction and an enhanced loss rate. For 
comparison, the diabatic energy curve (black 
line) shows the behavior in the absence of 
resonant dipolar interactions. Although lossy 
collisions are suppressed for |E| just above 
resonance, elastic dipolar collisions for KRb 
are predicted to be nearly unaffected by the 
shielding (39). In general, the shielding is pre- 
dicted to increase the ratio of elastic collisions 
to reactive collisions, potentially allowing for 
efficient evaporative cooling of molecules in 3D 
(42) and complementing the recently demon- 
strated evaporation in 2D (37). 

This effect is akin to a Forster resonance— 
for example, between Rydberg atoms (53-55), 
in which |E| is tuned to create degeneracies 
between pairs of dipole-coupled states, result- 
ing in resonant energy transfer. A key difference 
is the much smaller dipole moment of molecules 
relative to that of Rydberg atoms. Consequently, 
colliding molecules experience an adiabatic 
increase in the dipolar interaction energy as 
they approach. We stress that this is not a 
conventional scattering resonance arising 
from the presence of a molecule-molecule 
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Fig. 1. Electric field—induced shielding. (A) Energies of the relevant combined 
molecular states as a function of |E|, displaying the crossing of |1,0)|1,0) with 
|0,0)|2,+1) and |0,0)|2,0). (B) Qualitative picture of the region near |E2| (black 
dotted line). For large molecular separations r >> ro (dashed lines), the two 
States are not coupled. For separations r ~ ro (solid lines), an avoided crossing 
is opened as a result of dipolar interactions between the states. As two |1,0) 
molecules collide at a fixed |E] > |E2| (orange dotted line), the avoided crossing results 
in an effective repulsion; at a fixed |E| < |E,| (green dotted line), the avoided crossing 


bound state, but rather a resonance between 
two free scattering states enabled by the in- 
ternal structure of the molecules (39). 

The experimental setup has been described 
in detail previously (37). In brief, a degenerate 
mixture of “°K and ®’Rb was prepared in six 
layers of a 1D optical lattice, with final trap 
frequencies (Or, Oy, ©) = 27 x (34, 17.7 x 10°, 
34) Hz for KRb in each layer (gravity points 
along —y). Weakly bound molecules were 
created with a magnetic field ramp through an 
interspecies Feshbach resonance at 546.62 G 
and were transferred to the ro-vibronic ground 
state by stimulated Raman adiabatic passage 
(STIRAP) at |Egqipap| = 4.5 KV/cem with the 
field along +y. For studying the |1,0) state, it 
was advantageous to perform STIRAP at large 
|E| to bypass several avoided crossings at |E] < 
1 kV/cm that arise from the hyperfine struc- 
ture (21). Typical starting conditions were 2 x 
10* molecules in the |0,0) state at a temper- 
ature T = 250 nK, corresponding to about 
1.8 times the Fermi temperature. With nearly 
perfect occupancy of the lowest band (Ap7/ho,, 
~ 0.3, where kpg is the Boltzmann constant and 
fis the reduced Planck constant) and negligi- 
ble tunneling between lattice sites, our system 
realized a stack of quasi-2D molecular gases. 

To measure the reactive loss of the |1,0) 
state, we used the following protocol. Starting 
at Esrtrrap, We first applied a microwave n-pulse 
to transfer the molecules from |0,0) to |1,0) 
with a Rabi frequency of 2m x 200 kHz and a 
typical efficiency above 95%. Any remaining 
|0,0) population was quickly lost in a few 
milliseconds via s-wave reactive collisions 
with |1,0). Next, E was ramped to its target 
configuration in 60 ms. After a variable hold 
time ¢, E was ramped back to Egyypap in 60 ms. 
To image |1,0) molecules, we applied another 
microwave pulse to transfer the molecules 
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results in an effective attraction. The same argument applies near |E;|. (C) Adiabatic 
energy curves for KRb molecules in |1,0) colliding with m, = +1 at |E] = 12.504 kV/cm 
(green curve) and 12.670 kV/cm (orange curve), corresponding to the values of |E| 
indicated by the green and orange vertical dotted lines in (B). The diabatic energy curve 
at 12.670 kV/cm (black solid curve) shows a barrier (36 wk) from the sum of the 
centrifugal (24 wk) barrier (51) and the semiclassical dipolar (12 wk) barrier (31, 37, 38) 
in the absence of resonant dipolar interactions. The dashed line shows the average collision 
energy of 250 nk. The red and blue spheres represent Rb and K, respectively. 
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Fig. 2. Experimental setup and control of reactions by the electric field orientation. (A) Schematic of 
the experiment geometry. Owing to the strong confinement along y, the m, states participating in the 
collisions can be controlled with 0. (B) Contribution of the m, = O (red) or m, = 1 (blue) channels to the 
scattering as a function of @. (C) B as a function of 6 for a fixed bias strength |E| = 7.09 kV/cm. Error bars are 


1 SE from fits to the two-body rate equation. 


back to the |0,0) state, then used STIRAP to 
transfer to the Feshbach state before imaging 
the molecules in time-of-flight expansion. A 
constant magnetic field of 545.5 G was present 
during the measurements. We fitted the mea- 
sured average density n as a function of t to 
the solution of the two-body loss rate equation 
dn/dt = -Bn°, where 6 is the two-body chemi- 
cal reaction rate coefficient. In 2D, there is no 
temperature increase associated with the two- 
body loss (31, 56), hence this rate equation is 
simplified in comparison to 3D (29). 

To fully characterize the shielding effect, we 
first show how to tune the angular momen- 
tum character of the collisions by changing the 
orientation of E relative to the quasi-2D planes. 
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Previous measurements in quasi-2D with E 
oriented along the tightly confined direction 
(y) showed a suppression of B at moderately 
large values of d owing to repulsive mz = +1 
dipolar collisions (31, 37). Here, we studied 
the dipolar anisotropy by tilting E away from 
the y axis by an angle 0, which enabled con- 
trolled mixing of the attractive m; = O col- 
lisions into the scattering (Fig. 2A). Although 
the dipolar interaction in general mixes higher 
partial waves into the scattering, contribu- 
tions from |mz| > 1 are negligible for the rel- 
atively small values of d explored here (52). 
Although the collisions always occurred along 
the £ and 2 directions because of the strong 
confinement along zy, the angular momentum 
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Fig. 3. 3D characterization of the shielding effect. (A) B4 (blue circles) versus |E| extracted from loss measurements at 8 = O° (B) Bo (red squares) versus |E| 
extracted from loss measurements at © = 90° and 0°. In (A) and (B), solid lines are theoretical predictions for the experimental T = 250 nK and œ; = 2r x 17.7 kHz with 
no free parameters. Error bars are 1 SE from fits to the two-body rate equation. (C) Close-up of B.; in the region near |E;| and |E2| [gray shaded region of (A)]. 


character of the collisions with respect to E 
changed with 0. For 6 = 0° the collisions de- 
composed equally into the mz; = +1 channels, 
which give equal contributions to the collision 
cross section as a result of the azimuthal 
symmetry of the dipolar interaction. For 0 = 
90°, collisions along # corresponded to Mz = O 
scattering, and those along 2 were still an 
equal superposition of the mz = +1 channels. 
Hence, by measuring ß at 0 = 0° and 909, it 
was possible to extract the loss rate coeffi- 
cients B., and By associated with the mz = +1 
and O channels, respectively. The full dependence 
on 9 (Fig. 2B) was calculated by considering 
the mixing of the mz states under rotations (52). 

Figure 2C shows the measured 8 for |1,0) 
molecules at |E| = 7.09 kV/cm as 9 was varied 
over 180° As expected, 8 increased with |0| 
and reached a maximum at 0 = + 90°, where 
attractive 7, = 0 collisions dominated the loss 
rate. At 7.09 kV/cm, the relatively small value 
of d = —0.12 D limited the maximum increase 
of B to only an order of magnitude, in contrast 
to the much larger effect expected for larger |d| 
(57). Our electrode geometry permitted excel- 
lent control of the curvature of E along 2, 
except near 8 = + 90° (fig. S1), where we 
applied a small correction to the measured 
B to account for compression of the cloud 
attributable to the increased curvature in 
this configuration (52). 

Having controlled the angular momentum 
channels participating in the collisions, we 
proceeded to explore the dependence of the 
|1,0) reaction rate on E. We measured $ at 
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both 9 = 0° and 90° to extract B., and By as a 
function of |E| (52), as summarized in Fig. 
3, Aand B, respectively. For both values of 9, 
we calibrated |E| to a few parts in 10* using 
spectroscopy on the |0,0) to |1,0) transition. 

In the background region (|E| = 1 to 11 kV/ 
cm) away from resonance, we observed a slight 
decrease in B+ and a corresponding increase in 
Bo. Baq reached a minimum and Bo reached a 
maximum near |E| = 7 KV/cm, in agreement 
with theoretical predictions (Fig. 3, A and B, 
solid lines) (52). To understand the trends of 
B+ and Bo, we note that |d| is nonmonotonic 
in the investigated range of |E| and reaches a 
maximum of 0.12 D at 7 kV/cm. Thus, the 
trends of Bı and Bo are consistent with the 
semiclassical picture of repulsive side-to-side 
(Mr, = +1) or attractive head-to-tail (7m, = 0) 
dipolar collisions modifying the loss rate, as pre- 
viously measured for the N = O state (31, 37, 38). 
Away from resonance, our results illustrate the 
universal nature of this semiclassical effect, 
depending only on the value of |d| and not 
on the rotational state of the molecule, when 
resonant dipolar effects are not important. 
In addition, the absence of any field-dependent 
Fano-Feshbach resonances is consistent with 
universal (unit probability) loss at short range 
for the |1,0) state (57), as previously measured 
with N = O molecules (34, 35). 

In the region near |E,| and |E,|, resonant 
off-diagonal dipolar couplings to |0,0)|2,+1) 
and |0,0)|2,0) become the dominant contri- 
bution, instead of the diagonal dipolar inter- 
actions that determine B in the background 
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region. Here, inelastic collisions, which result 
in transitions to the nearby combined molec- 
ular state, are possible owing to the long-range 
dipolar mixing of the states. Because only |1,0) 
molecules were detected in the experiment, 
the measured loss rate consisted of the sum of 
the inelastic and reactive rates, with the inelas- 
tic rate predicted to be negligible except for 
within a small range of |E| near resonance 
(39). The optimal shielding condition, where 
the overall loss is minimized, arises from a 
competition between the changes in the height 
of the repulsive barrier and the inelastic loss 
rate, with the inelastic rate falling off faster 
away from resonance (39). 

We observed sharp features at |E,| and |E,| 
in both the m; = +1 and m,; = O channels, in 
excellent quantitative agreement with the scat- 
tering theory predictions with no free param- 
eters (Fig. 3, A and B, solid lines) (52). We 
measured a maximum variation of B., by a 
factor of 300 + 20, and a reduction in B,, bya 
factor of 8 + 3 at the optimal shielding con- 
dition (12.77 kV/cm) relative to the value away 
from the features (10.13 kV/cm). In the m,; = 0 
channel, we observed a variation of Bo by a fac- 
tor of 1000 + 400 near the features. Compar- 
ing the measurements at the optimal shielding 
point (11.84 kV/cm) and away from the fea- 
tures (11.32 kV/cm), we observed a maximum 
suppression of By by a factor of 23 + 10 below 
its background value. (We excluded the point 
at 12.72 kV/cm from this analysis, because the 
extracted By was consistent with zero within 
our measurement precision.) 
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Fig. 4. Reaction shielding and enhancement 
near [E2]. Molecular loss measurements at |E| = 
12.50 kV/cm (green squares) and 12.67 kV/cm 
(orange circles) for 8 = O° are shown. The inset 
shows the data for JE] = 12.50 kV/cm on an 
enlarged x axis. Solid lines are fits to the two-body 
rate equation; error bars are 1 SE of independent 
measurements. 


As opposed to the semiclassical nature of 
the loss suppression when dipolar molecules 
are made to collide side-to-side (31, 37), the 
presence of the resonances in both the mz = 0 
and myr, = +1 channels highlights the quantum 
nature of the shielding mechanism, which is 
based on level repulsion between the two 
combined molecular states brought to dege- 
neracy by E. The effect occurs independently 
of the relative orientation of the dipoles, be- 
cause approaching with either my, = 0 or mz = 
+] causes dipolar mixing of the states. Our 
quasi-2D measurements indicated that the 
shielding is present in all three of these chan- 
nels at the same values of |E], even in the case 
of mr, = O where the dipoles approach in 
attractive head-to-tail collisions. We thus fully 
expect that the shielding will also be effective 
in 3D (39). 

Figure 3C shows a close-up of £..,; near |E,| 
and |E,|, emphasizing the narrow widths of 
the features. An intuitive explanation for the 
observed widths, which were on the order of 
tens of V/cm, comes from comparing the res- 
onant dipolar interaction energy with the 
Stark shift of the two crossing states. Away 
from resonance, the reaction rate is controlled 
by the height of the p-wave centrifugal barrier, 
which occurs at 79 (51). Near resonance, this 
barrier is modified by the dipolar coupling 
Vaa between the states, with an approximate 
energy scale of Vaa ~ do /4n£oro = h x (16 MH2), 
where dy = 0.574 D is the permanent dipole 
moment, €, is the permittivity of free space, 
and A is the Planck constant. The differential 
Stark shift of |1,0)|1,0) and |0,0)|2,0) near 
12 kV/cm is roughly 0U/O|E| = h x 215 kHz/ 
(V/cm). This argument suggests a width on the 
order of Vgg/(OU/O|E]|) = 75 V/cm, in qualita- 
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tive agreement with the exact result from 
scattering calculations. 

To underscore the enormous change in ß 
under a small variation of | E|, Fig. 4 displays 
two molecular loss curves in the vicinity of 
lE | at 0 = 0°, corresponding to the largest 
measured difference in B41. The values of |E] 
for the two curves, 12.50 kV/cm (green squares 
and inset) and 12.67 kV/cm (orange circles), 
correspond to the adiabatic energy curves in 
Fig. 1C. At |E| = 12.50 kV/cm, the loss rate was 
strongly enhanced and the molecules were 
lost within 100 ms. In contrast, at |E| = 12.67 kV/ 
cm, the molecules were shielded from loss and 
~20% of the initial density was still detected 
after 20 s of hold time, thereby realizing a 
long-lived gas of polar molecules in a large 
electric field. 

We have demonstrated a method for control- 
ling reactive losses using an external electric 
field and find excellent agreement with theo- 
retical predictions. Our investigation of the 
my, = 0 and m, = +1 collision channels strongly 
suggests that the shielding remains effective 
in 3D geometry, without the need for an optical 
lattice to protect the molecules. Indeed, we have 
made preliminary observations of long-lived 
molecules in a crossed optical dipole trap at 
12.67 kV/cm. The shielding could be used to 
create a favorable ratio of elastic to reactive 
collisions for evaporative cooling, which 
would simplify future efforts to create quantum- 
degenerate molecular gases for species other 
than KRb (41, 42). These results provide long- 
lived quantum gases of polar molecules in 
strong electric fields that are ready to be used 
to explore a wide range of exciting many- 
body phenomena and quantum information 
applications. 
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Stem-like CD8 T cells mediate response of adoptive 
cell immunotherapy against human cancer 


Sri Krishna’, Frank J. Lowery’*, Amy R. Copeland’, Erol Bahadiroglu*, Ratnadeep Mukherjee, 
Li Jia*, James T. Anibal*, Abraham Sachs’, Serifat 0. Adebola”, Devikala Gurusamy’, 

Zhiya Yu’, Victoria Hill’, Jared J. Gartner’, Yong F. Lit, Maria Parkhurst’, Biman Paria’, 

Pia Kvistborg*, Michael C. Kelly’, Stephanie L. Goff!, Grégoire Altan-Bonnet?, 


Paul F. Robbins'+, Steven A. Rosenberg’ + 


Adoptive T cell therapy (ACT) using ex vivo—expanded autologous tumor-infiltrating lymphocytes (TILs) 
can mediate complete regression of certain human cancers. The impact of TIL phenotypes on 
clinical success of TIL-ACT is currently unclear. Using high-dimensional analysis of human ACT products, 
we identified a memory-progenitor CD39-negative stem-like phenotype (CD39 CD69 ) associated 
with complete cancer regression and TIL persistence and a terminally differentiated CD39-positive 
state (CD39*CD69"*) associated with poor TIL persistence. Most antitumor neoantigen-reactive 
TILs were found in the differentiated CD39° state. However, ACT responders retained a pool of CD39" 
stem-like neoantigen-specific TILs that was lacking in ACT nonresponders. Tumor-reactive stem-like 
TILs were capable of self-renewal, expansion, persistence, and superior antitumor response in vivo. 
These data suggest that TIL subsets mediating ACT response are distinct from TIL subsets enriched for 


antitumor reactivity. 


ancer immunotherapies such as im- 

mune checkpoint blockade (ICB), adop- 

tive T cell therapy (ACT), and chimeric 

antigen receptor (CAR) therapy rely on 

the targeted destruction of cancer cells 
by potent antitumor T cells (/-6). Studies of 
ACT using tumor-infiltrating lymphocytes (TILs) 
(termed TIL-ACT) and ICB indicate that anti- 
tumor responses can be mediated by T cells that 
target cancer cells through the recognition of 
mutated neoantigens presented on human 
leukocyte antigen (HLA) molecules (7-11). In 
multiple phase 2 clinical trials, TIL-ACT has 
been shown to mediate complete durable re- 
sponses in certain patients with metastatic 
melanoma as well as epithelial cancers that 
are conventionally considered to be weakly 
immunogenic (5, 12-14). 

Successful immunotherapy is influenced by 
several tumor-intrinsic factors, including tu- 
mor mutational burden, neoantigen burden, 
HLA type and expression, DNA damage repair 
capacity, and expression of the programmed 
death ligand-1 (PD-L1) protein (1, 2, 15-17). Re- 
cently, T cell-intrinsic factors have also been 
associated with ICB response in murine and 
human studies (18-27). This subset of self- 
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renewing stem cell-like TILs has been reported 
to exist in distinctive intratumoral structures 
and is characterized by the expression of tran- 
scription factor 7 (TCF7), along with the lack of 
cell surface inhibitory markers such as CD39 
or TIM3 (8, 22). However, analyses of tumor- 
reactive populations have concluded that anti- 
tumor neoantigen-specific TILs are enriched in 
subsets defined by the expression of PD-1 or 
CD39 (23-29). Thus, there is a lack of consensus 
regarding the tumor-reactive TIL subset that is 
directly responsible for successful immuno- 
therapy including TIL-ACT. 

Although the influence of the state of T cell 
differentiation on ACT has been evaluated in 
murine studies (30), the phenotypic fitness land- 
scape of tumor-reactive TILs associated with 
the TIL-ACT response against human cancer 
is unclear. ACT using TILs in lymphodepleted 
human cancer patients provides an opportunity 
to investigate the T cell states responsible for 
cancer regression, because the T cell-enriched 
treatment product can be isolated and studied. 
We therefore compared the phenotypic differ- 
ences that could distinguish TIL-ACT infusion 
products administered to patients who had 
complete response to therapy [termed com- 
plete responders (CRs), 2 = 24] from those 
whose disease progressed after ACT [termed 
nonresponders (NRs), 7 = 30] (Fig. 1A). These 
retrospective TIL-ACT infusion products were 
from a cohort of patients who had stage IV 
metastatic melanoma and had been treated 
with autologous in vitro-expanded TILs that 
had not been selected on the basis of tumor 
reactivity (“unselected” TILs). Additionally, 
patients in this cohort had not experienced 
prior immunotherapies in the form of genet- 
ically engineered T cell therapy or PD-1 block- 
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ade that might influence T cell phenotype 
(table S1 and Study Cohort Description section 
of the materials and methods) (/4, 37). 

Initial single-cell analysis of a discovery set 
of 4.8 million infusion-product TILs from 7 CRs 
and 9 NRs by mass cytometry (CyTOF) revealed 
heterogeneous expression of 34 cell surface 
markers (fig. S1 and table S2). Supervised 
analysis of TIL CyTOF profiles identified clus- 
ters that appeared prevalent in either CRs or 
NRs (Fig. 1, B and C). Cluster 1, which was four 
times more abundant in CR infusion products 
relative to NR infusion products (corrected P = 
0.0264; Fig. 1D), corresponded to CD8*" T cells 
with high expression of CD44, CD27, and CD28 
and low expression of TIM3, characterized in 
prior studies as memory-like (J8) and stem-like 
(22) T cells (Fig. 1C). Notably, cluster 1 also had 
low expression of inhibitory marker CD39 and 
T cell activation marker CD69 (Fig. 1C). Machine 
learning-based unsupervised clustering of 
T cell activation and/or exhaustion states to clas- 
sify patients according to their clinical outcome 
further confirmed CD69 and CD39 expression 
as two important features of most clinical 
relevance (fig. S2). Of note, cluster 2, with high 
levels of CD39 and CD69 (CD8*CD39* CD69") 
but lower expression of CD44, CD27, and 
CD28 (Fig. 1C), trended higher in NR infusion 
products than in CR infusion products, al- 
though this difference was not statistically 
significant (Fig. 1D). 

Flow cytometric analysis of the 16 discovery 
samples defining CD8°CD39 CD69 cells as 
members of cluster 1 recapitulated CyTOF data 
with high confidence (fig. S3, A and B). An 
evaluation of an independent set of 38 infusion 
products (CRs n = 17, NRs n = 21) by multi- 
parameter flow cytometry revealed that the 
frequency of the CD8°CD39 CD69" TIL pop- 
ulation was 2.5-fold higher in CR infusion 
products relative to NR infusion products (P = 
0.0096, percentage of CD3), supporting the 
association between this subset and ACT re- 
sponse (Fig. 1, E and F). Whereas the total 
number of infused T cells did not differ sig- 
nificantly between CRs and NRs in this set of 
38 patient samples, the total number of in- 
fused CD8°CD39 CD69" cells was fourfold 
higher in CR infusion products than in NR in- 
fusion products (P = 0.0031; Fig. 1, G and H). 
When querying individual markers, there was 
a modest trend toward an association between 
CD8* TIM3* TILs and nonresponse to ACT 
(fig. S3C). Analyzing patient survival from all 
54 patients, we found that whereas the ab- 
solute number of infused CD39* CD60" T cells, 
which comprised most of the CD8* TILs in 
patient infusion products (fig. S3C), did not 
significantly affect melanoma-specific survival 
(MSS) or progression-free survival (PFS) in 
this cohort (fig. S4), higher numbers of in- 
fused CD8*CD39 CD69 cells were significantly 
associated with improved PFS [P < 0.0001, 
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hazard ratio (HR) = 0.255, 95% confidence 
interval (CI) 0.1257 to 0.5186; Fig. 11] and 
MSS (P < 0.0001, HR = 0.217, 95% CI 0.101 to 
0.463; Fig. 1J) in a dose-dependent manner 
(fig. S4 tertile analysis, bottom row). Fur- 
thermore, the ratio of CD8*CD39 CD69" to 
CD8*CD39* CD69" cells in the infusion products 


significantly affected MSS and PFS (fig. S5), 
suggesting that the ACT responses in this 
cohort were associated with the infusion of 
higher numbers of CD8*°CD39 CD69 cells and 
not fewer CD8*CD39* CD69" TILs. 

To further explore the potential importance 
of CD8*CD39 CD69" TIL [cluster 1, double- 
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negative (DN)] and the infusion product- 
predominant CD8*CD39*CD69" TIL population 
[cluster 2, double-positive (DP)], we evaluated 
the transcriptome profile of these two subsets 
(fig. S6A). DN TILs had increased expression 
of quiescent T stem cell markers KLF2, TCF7, 
SIPRI, LEFI, IL7R, CD27, and SELL (CD62L), 
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Fig. 1. Phenotypic landscape of TIL infusion products from patients with 
metastatic melanoma treated with ACT. (A) Melanoma cohort patient infusion 
products and schema used for this study. (B) t-SNE (t-distributed stochastic 
neighbor embedding) plots of live CD45*CD3* cell clusters of infusion products 
from all patient infusion products (left), clusters from CR infusion products 
only (in red) (middle), and clusters from NR infusion products only (in black) 
(right). Cluster 1 represents the CD39- CD69~ DN population. (C) Heatmap of 
scaled protein expression (columns) per each cluster (rows); red arrows indicate 
CD69 and CD39 expression. The red asterisk indicates cluster 1. (D) Plot 
showing the percentage of CR and NR infusion-product cells in each cluster. 
Error bars indicate SEM. *P = 0.0264 from two-sided Wilcoxon rank sum test 
adjusted by Bonferroni correction for all clusters. (E and F) Flow cytometry- 
based independent validation of CR infusion products and NR infusion-product 
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samples (n = 38) showing the percentages of CD39°CD69° (DN) cells of total CD8* 
(E) and total CD3* (F). Error bars indicate mean + SEM. (G and H) Scatterplot 
showing (G) the total number of infused cells and (H) total number of infused 
CD8*CD39 CD69" (DN) cells within CR infusion products and NR infusion products 
(n = 38). *P < 0.05 and **P < 0.01 by two-sided Wilcoxon rank sum test. Error 
bars indicate mean + SEM. ns, not significant. (I) PFS of patients after infusion- 


= 54) separated by median total cell numbers infused 


(left) and median CD8*CD39° CD69" (DN) cell numbers infused (right). (J) MSS of 
patients after infusion-product administration (n = 54) separated by median cell 


median CD39 CD69" cell number infused (right). “Low” 


indicates patients with infused cells less than median of the subgroup analyzed, 
and “high” indicates patients with cells infused greater than median of the subgroup 


log-rank Mantel-Cox test are shown. 
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whereas DP TILs expressed CD38, MKI67, TNF, 
and JFNG found in differentiated, activated 
T cells (fig. S6B and table S3). Unsupervised 
clustering of 20,672 CD8" TILs by single-cell 
transcriptome analysis (scRNA) from 10 pa- 
tient infusion products (5 CRs, 5 NRs) defined 
eight clusters (Fig. 2, A and B). Analysis of the 


distribution of TILs from CR and NR infusion 
products indicated that they segregated into a 
two-cluster solution (178) defined by S.Cluster.A 
(Super Cluster A, comprising clusters CO, C2, 
C5, C6, and C7) and S.Cluster.B (Super Clus- 
ter B, comprising clusters C1, C3, C4, and 
C8). Responder TILs were more frequent in 


S.Cluster.A (P = 0.0317), whereas S.Cluster.B 
largely comprised NR TILs (P = 0.03; Fig. 2, 
B and C). TILs in S.Cluster.A were enriched 
for DN gene signature (81% overlap; fig. S6C 
and table S4), whereas TILs in S.Cluster.B 
were enriched for DP gene signature (Fig. 2, 
D and E). Consistent with these analyses, CR 


scRNA (n = 10) 
COHG2+C5+C64+C7 


100 





80 


= 
© 


60 


© 
© 


40 


% Of CD8+ 


20 


0 





% Of Clusters 
S 














CFF SSSSSS 
S.Cluster B PANG 4 D409? ZO? ZG ACD HP A HAD 7 
; AST PP AO” obh? nH? ADP AF" af A 
C1+C3+C4+C8 
E — © CD39- CD69- Signature 
8 
S.Cluster.A S.Cluster.B : 


DN scGSEA 1 


LINCO0861 
C100rf54 


DP scGSEA 




























C S. Cluster A H After Stim 
* f 
; , 100 e ] 14.22 . 
Cluster Frequencies Per Patient 2 EA Before Stim After Stim (N=6) 
EE CO0 Æ C2 Æ C5 E C6 HC z 3791-CR e DN Parent k 
Oc Œ C3 mw C+ mcs = 4 = DP Parent 4 
O “4 wae 
O AL wd 
x 5 
A 
W 


S. Cluster B 
* 
| ce 


DN CD39.sp CD69.sp DP 





CD39 FITC 


Sade-Feldman ICB cohort Pre-Treatment 


Responders Non-Responders J ven ty K CD39 CD69" Signature 
+i F a Se ! 0.2 eK 
CD39 CD69 Signature AO TLRS e < 
oe 2K kok a 0.1 
s E E | O 
2 0.0 
B -0.1 
-0.2 


CD39*CD69" Signature 


KKK 


w“ 


Response Non Response 
N=902 N= 1652 


DP scGSEA 





oF £L 
$ 6 Ss 5 
-0.5 se oF & 
j 7 SS 7 Z 
SOSSS SISSS LEE LP ES 
AS 4 ADM AD 7 ak LAD AR AD AD Z Ae SY oS” 
DP AEA? SP “23 Hr PMA o 3 
G | SPP PNAS? FP a PnpPns SF eS Fe i 


ATAC Seq 


N Isotype 


E CD39- CD69- 


CD39- CD69- (n=3) 


KE TF Families 


e C2H2-2F a ZNF32 


7 


-log10(adjusted P-value) 






CD39+ CD69+ 15.0 “font 
% TCF7+ ui s cara 125 i 
100 kkk i g Oher o0 Bx, 
80 10.0 cheer | NB 
EF1 75 2 


7.54 oKLF9 
60 eKLF12 
40 


20 


% Of CD8+ 





1000 0 


400 
Rank 


CD39+ 0 600 800 
CD69+ 


CD39- 
CD69- 





Fig. 2. CD39°CD69° CD8* TILs in infusion products are in a memory progeni- 
tor stem-like state. (A) t-SNE plot of all CD8* TILs from CR and NR infusion 
products (5 CRs, 5 NRs). (B) Frequency distribution of the percentage of total CD8* 
T cells in each cluster (C1 through C8) for each patient infusion product. (C) Box 
plot showing median percentage of S.Cluster.A (top) and S.Cluster.B (bottom) 
within CR and NR infusion products with error bars showing minimum to maximum. 
*P < 0.05 by two-sided Wilcoxon rank sum test is shown. (D) Heatmap of the 

top 15 differentially expressed genes between S.Cluster.A and S.Cluster.B displayed 
for each cell. All discriminating genes are listed in table S4. (E) t-SNE plot displaying 
the top two quartiles of CD39°CD69° (DN) and top two quartiles of CD39°CD69* 
(DP) gene expression signature. (F) Each patient infusion product was scored by 
DN and DP gene signature scores, and their mean scGSEA scores are plotted on the 
y axis. ****P < 0.0001 by two-sided Wilcoxon rank sum test comparing the mean 

DN and DP signature scores between CRs and NkRs. Error bars indicate mean + SEM. 
(G) Intracellular TCF7 expression by flow cytometry in DN and DP subsets. The 
histogram shows a representative patient infusion-product sample (top), and the box 
plot shows the quantitation for 18 infusion products (bottom), with error bars 
showing minimum to maximum. ***P < 0.001 by two-sided Wilcoxon rank sum test. 
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(H) Flow cytometry plots showing the phenotypes of DN, SP, and DP states 
before and after anti-CD3 and anti-CD28 stimulation (Stim) for 120 hours of a 
representative patient sample (left), and dot plots quantifying phenotypes of 
daughter cells after stimulation of FACS DN parent (red) or FACS DP parent 
(black) (right). ***P < 0.001 by two-sided Wilcoxon rank sum test, n = 6 infusion 
products. Error bars indicate mean + SEM. APC, allophycocyanin; FITC, 
fluorescein isothiocyanate. (1) Enriched TF motifs in CD39 CD69" (DN) 

and CD39°CD69" (DP) TILs from patient infusion products in the validation 

set (n = 3 each) ranked by adjusted P values. The full list of significant TFs is 
shown in table S5. (J) t-SNE clusters of CD8* TILs from the melanoma ICB 
cohort (18) (top) and t-SNE plot colored by top two quartiles of DN and DP gene 
signatures (DN, red; DP, black) (bottom). (K) TILs from pre-ICB therapy were 
scored by the top DN and DP gene signature scores, and mean scGSEA scores 
are plotted on the y axis. Error bars indicate mean + SEM. TILs from responding 
lesions are in red, and cells from progressing lesions are in black. Cell numbers and 
*x**p < 0.0001 by two-sided Wilcoxon rank sum tests are shown. (L) Clustered 
correlation matrix of gene signatures from other studies (table S6) with DN and DP 
scGSEA scores on pre-ICB cells from the cohort. 
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TILs and NR TILs scored by single-cell gene 
set enrichment analysis (ScGSEA) of DN and 
DP gene signatures indicated that CR TILs had 
higher DN scores on average, whereas NR TILs 
had higher DP scores (P < 4.5 x 10°” for both; 
Fig. 2F). 

We observed lower cell surface expression 
and coexpression of the exhaustion markers 
PD-1 and TIM3 in the DN subset relative to single- 
positive (SP) (CD39*CD69" and CD39° CD69") 
and DP subsets (n = 39 infusion products; fig. 
S7). Conversely, expression of memory mark- 
ers CD62L and CD27 and progenitor marker 
SLAMF6 were increased in DN relative to DP 
subsets with variable expression within SP 
populations (fig. S7). Protein expression of the 
memory stem-like T cell marker TCF7 was 
fivefold higher in DN TILs relative to DP TILs 
(P = 0.0002, n = 18 infusion products; Fig. 2G) 
consistent with our scRNA analysis and other 
studies (18, 19). Although DN TILs in infu- 
sion products expressed higher levels of both 
SLAMF6 and TCF7 transcripts (fig. S8A), 
TILs coexpressing TCF7 and SLAMF6 were 
not significantly enriched within DN subsets 
(fig. S8B), and addition of SLAMF6 to DN as 
a surface marker was comparable to DN alone 
in delineating TILs associated with survival 
benefit in this patient subset (fig. S8C). By 
contrast, DP TILs had significantly higher tran- 
script and protein expression of the effector- 
and exhaustion-associated transcription factor 
TOX relative to DN TILs (fig. S8, A and B) (32). 
Notably, although the transcriptional profile 
of DN TILs resembled that of peripheral 
blood stem cell memory (SCM) T cells (33, 34), 
cell surface expression of the canonical SCM 
markers CD95 and CCR7 was not associated 
with the DN phenotype in patient TIL infusion 
products (fig. S9). 

To experimentally confirm that CD39 CD69" 
infusion-product TILs were stem-like T cells, 
we isolated and stimulated the DN, SP, and DP 
subsets using anti-CD3 and anti-CD28 in vitro. 
Upon T cell receptor (TCR) stimulation, DN 
TILs underwent self-renewal and gave rise to 
both SP and CD39*CD69* DP populations, 
whereas stimulated DP TILs largely remained 
in the same DP state, indicating a DN-progenitor 
state (n = 6 infusion products; Fig. 2H). Time- 
course kinetics of the daughter populations 
indicated that essentially all DN TILs under- 
went CD69 activation, but a subset of DN TILs 
(median = 17%) rested back to the CD39 CD69" 
stem-like state 5 days after initial stimulation 
(fig. S10). By contrast, both CD39* parent pop- 
ulations (CD39 SP and DP) appeared to ter- 
minally differentiate into a stable CD39*CD69* 
state after TCR stimulation (fig. S10). Although 
DP and DN TILs secreted similar levels of 
interleukin-2 (IL-2), interferon-y (IFN-y), gran- 
zyme A (GZMA), granzyme B (GZMB), and 
perforin in response to TCR stimulation, DN 
TIL stimulation resulted in higher levels of se- 
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creted IL-17A, tumor necrosis factor-a (TNF-a), 
IL-4, sFasL, and granulysin than DP TILs (fig. 
S11). Taken together, our results indicate that 
ACT response-associated CD39 CD69 TILs are 
in a progenitor memory stem-like state capable 
of differentiation into other subsets, whereas 
the infusion product-dominant CD39*CD69* 
TILs are in aterminally differentiated state. 

To understand the key regulators of these 
two TIL states in the infusion products, we 
isolated matched DN and DP TILs from three 
patient infusion products and analyzed their 
epigenetic profiles using ATAC (assay for trans- 
posase accessible chromatin) sequencing. We 
queried the presence of 4314 human transcrip- 
tion factor (TF) motifs in open chromatin 
regions of DN and DP TILs (fig. S12 and table 
S5). Open chromatin regions within stem-like 
DN TILs were enriched for numerous binding 
sites for SOX and C2H2-ZF family TFs includ- 
ing KLF4, TCF7, LEF1, and TCF7L] (Fig. 21 and 
fig. S12) consistent with those of less differen- 
tiated human T cells (8, 35). By contrast, chro- 
matin regions within DP TILs were extensively 
enriched for multiple binding motifs of bZIP TFs 
FOSL1, FOS, JUNB, and JUND, indicating that 
epigenetic imprinting may be involved in main- 
taining the state of terminal differentiation 
resulting from chronic tumor cell activation 
(Fig. 2I and fig. S12) (35, 36). 

We inquired if TIL states in the infusion 
products corresponded to those within the 
fresh tumor by reanalyzing the transcriptomic 
profiles of CD8* T cells from a previous mela- 
noma ICB response study (18) using DN and 
DP gene signatures described above (Fig. 2, J 
to L). We found that before immune check- 
point therapy, TILs from ICB-responding le- 
sions had higher CD39 CD69 signature scores 
(P = 1.9 x 10°) than nonresponding lesions, 
whereas TILs from lesions that progressed 
during ICB had higher CD39*CD69* signature 
scores (P < 2.2 x 10°’). Hierarchical cluster- 
ing correlation analysis with T cell dysfunc- 
tional and progenitor gene signatures from 
other recent studies (listed in table S6) re- 
vealed that CD39 CD69 TILs were most sim- 
ilar to ICB response-associated memory 
and progenitor-exhausted TILs, whereas 
CD39*CD69* TILs were highly correlated 
with TOX” terminally exhausted TILs that are 
associated with poor ICB response (Fig. 2L) 
(18, 19, 32, 37). We developed a phenotypic 
fitness score, defined as the difference be- 
tween DN stem-like signature and DP differ- 
entiated signature (DN minus DP), and scored 
scRNA from ACT infusion products and mel- 
anoma ICB TILs. Single-cell fitness scores 
reconfirmed that ICB-responder TILs have 
on average more stem-like fitness compared 
with ICB-NR TILs (fig. S13). These data sug- 
gest that stem-like TILs found in ACT re- 
sponder infusion products resemble those 
found in ICB responses. 
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Previous studies on stem-like and terminally 
differentiated TIL subsets were performed on 
bulk TIL populations lacking specific analyses 
of antitumor T cells (78, 22). Given recent find- 
ings suggesting that neoantigen-specific TILs 
are nearly exclusively CD39*, whereas CD39" 
TILs represent bystander cells (23, 24, 27), we 
sought to determine whether the CD39 CD69" 
stem-like state associated with ACT response 
exists within the neoantigen-reactive T cells 
in infusion products. Using our previously de- 
scribed tandem minigene neoantigen screen- 
ing platform (72, 13), we defined 26 HLA class 
I-restricted neoantigens from infusion products 
for phenotypic evaluation (n = 11 patients; figs. 
S14 and S15 and table S7). Stem-like and differ- 
entiated TIL subsets defined by CD39 and CD69 
expression were detectable within HLA-neoantigen 
tetramer” TILs as shown in representative CRs 
and NRs (Fig. 3, Aand B). Combined analysis 
of all 26 neoantigen-specific T cell popula- 
tions indicated that they largely existed in the 
CD39*CD69* terminally differentiated state, 
consistent with other studies reporting CD39 
enrichment of neoantigen-specific T cells (me- 
dian DP = 62.7%; Fig. 3C) (23). 

Stratification of the data by ACT response 
status, however, revealed that the frequency of 
DN cells was significantly higher in neoantigen- 
specific TILs in CR infusion products (median = 
8.8%) than in NR infusion products (median = 
0.5%, P = 12 x 10°*; Fig. 3D). CD39 as a single 
marker indicated that CD39-negative (CD39 ) 
neoantigen-reactive TILs were significantly 
higher in CR TILs relative to NR TILs (P = 
1.86 x 10”; Fig. 3E). Adjusting for tumor-specific 
cells significantly enhanced these differences 
between CR and NR infusion products, wherein 
neoantigen-specific DN TILs were 23-fold 
higher in CRs (median = 2.3 x 10° cells), and 
CD39 TILs were 40.4-fold higher in CR (me- 
dian = 1.06 x 10° cells) (fig. S16). We found no 
differences in the total number of neoantigen- 
reactive TILs, or neoantigen-reactive CD39* 
or DP TILs between CR and NR infusion 
products (fig. S16). These results indicate that at 
least in the context of melanoma ACT infusion 
products, not all CD39 TILs are bystander 
T cells as previously reported (23) and that a 
subset of tumor neoantigen-reactive TILs exist 
in aCD39° stem-like state in CRs. Furthermore, 
these data indicate that enrichment of tumor- 
reactive neoantigen-specific TILs in different- 
lated subsets does not necessarily correspond 
to their frequency in stem-like states. 

To explore the heterogeneity in stem-like and 
differentiated states of neoantigen-specific TILs 
at the single-cell transcriptomic level, we per- 
formed combined scRNA and scTCR sequenc- 
ing on infusion products from a CR (patient 
3713) and an NR (patient 4000), with defined 
neoantigen-reactive TCRs [Fig. 3, F to L, and 
table S8 (38-41)]. The infusion product from 
responder patient 3713 was dominated by two 
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major clusters, CO and Cl, in which CO repre- 
sented the stem-like DN state and C1 repre- 
sented the differentiated DP state (Fig. 3F and 
fig. S17, A and B). Projection of the 20 neoantigen- 
specific TCR* (NeoTCR”) clonotypes showed 
a broad distribution among the two clusters 

























(Fig. 3, F and G). The immunodominant SRPX 
mutation-specific NeoTCRs (SRPXjnui) were 
enriched in cluster CO, whereas other NeoTCRs 
varied in their prevalence within the two clusters 
(Fig. 3G). CDR3B sequencing of fluorescence- 
activated cell-sorted (FACS) DN and DP infusion- 




































product TILs confirmed this observation (fig. 
S17, C and D). 

Finer analysis of SRPXmu NeoTCR* single 
cells indicated that 9 out of 10 of these clono- 
types (and 15 out of 20 of all NeoTCR clonotypes) 
were enriched for positive phenotypic fitness 
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Fig. 3. Stem-like state within neoantigen-specific TIL populations from 

CR and NR infusion products. (A) Representative CR infusion product showing 
detection of neoantigen-specific tetramers (top) and CD39 and CD69 
phenotypes of bulk CD8* TILs and CD8" tetramer” TILs (bottom). (B) NR 
infusion product with comparable numbers of neoantigens (top) and CD39/ 
CD69 phenotypes of bulk CD8* TILs and CD8* tetramer” TILs (bottom). 
Numbers within quadrants for (A) and (B), listed under the tetramer category, 
represent the percentage of the subpopulation within each CD8" tetramer” gate. 
(C) DN, SP, and DP phenotypes within all 26 neoantigen-specific T cell 
populations expressed as a percentage of tetramer” cells of each individual 
tetramer from 11 patients. CRs and NRs are combined for this data analysis 
(CRs + NRs). *P < 0.05 and ****P < 0.0001 by two-sided Wilcoxon rank sum 
test followed by Bonferroni correction for multiple comparisons. Bar indicates 
median. (D) DN, SP, and DP phenotypes of 26 neoantigen-specific TILs 
subdivided by response status (CRs versus NRs). **P < 0.01 and ***P < 0.001 
by two-sided Wilcoxon rank sum test between CRs and NR tetramer” cells 

for each subset. The bars indicate median. (E) CD39-negative neoantigen-specific 
tetramer® TILs (CD39° as a single marker) subdivided by response status (CRs 
versus NRs). ****P < 0.0001 by two-sided Wilcoxon rank sum test. Bar indicates 
median. (F) t-SNE plot of CD8” TILs from patient 3713-CR infusion product showing 


Krishna et al., Science 370, 1328-1334 (2020) 11 December 2020 


projection of stem-like DN and differentiated DP gene signatures (left) and 
projection of neoantigen-specific TCRs colored by antigen specificity (right). 
Dotted lines indicate the stem-like and differentiated clusters analyzed. 

(G) Patient 3713-CR NeoTCR” cells from stem-like CO cluster and differentiated 

Cl cluster displayed as a percentage of total NeoTCR” cells for each neoantigen 
specificity. (H) Each NeoTCR® clonotype scored by their mean fitness score 
(value of DN minus DP scGSEA scores). Positive scores indicate clonotype 
enrichment in stem-like phenotypes, and negative scores indicate clonotype 
enrichment in differentiated state. (I) Post-ACT persistence of the immunodo- 
minant SRPXmut NeoTCR clonotypes from patient 3713-CR infusion product in 
peripheral blood normalized to their initial frequency in the infusion product 
(day 0). (J) t-SNE plot of CD8* TILs from patient 4000-NR infusion product showing 
projection of stem-like DN and differentiated DP gene signatures (left) and a 
projection of neoantigen-specific TCRs colored by antigen specificity (right). 
Dotted lines indicate the stem-like and differentiated clusters analyzed. (K) Each 
NeoTCR® clonotype (HIVEP2 mut, AMPHmu:) scored by their mean fitness scores 
as described in (H). (L) Post-ACT persistence of the HIVEP2 mu: and AMPH mut 
NeoTCR clonotypes from patient 4000-NR in peripheral blood normalized to their 
frequency in the infusion product. Patient follow-up was stopped at 150 days 
because of disease progression. 
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scores (Fig. 3H). Because prior studies have 
suggested that T cell-intrinsic differences po- 
tentially affect post-ACT clonotypic persistence 
(33, 42), we analyzed the post-ACT peripheral 
blood of patient 3713. TCR repertoire sequenc- 
ing revealed that most of the NeoTCR clono- 
types (18 of 20) persisted up to 75 months, 
consistent with the idea of TIL expansion from 
the progenitor stem-like state (for SRPX nuts 
see Fig. 31; for other NeoTCRs, see fig. S17E). 
Among the 10 SRPX mu- NeoTCRs targeting 
the same neo-epitope, one clonotype (TCR-51) 
with the lowest phenotypic fitness score de- 
clined in frequency and became undetectable 
only 3 months after treatment (Fig. 3, H and J). 


Combined scRNA and scTCR sequencing of 
the NR patient 4000 infusion product showed 
two major stem-like clusters (C1 and C5) and 
one major differentiated cluster (CO) (Fig. 3J 
and fig. S18, A and B). However, NeoTCR’ cells 
(HIVEP2 mut and AMPH mup) were largely con- 
centrated in the differentiated CO cluster (Fig. 
3J and fig. S18, C to E). seGSEA scores indi- 
cated that both NeoTCR clonotypes had neg- 
ative phenotypic fitness scores, suggesting 
terminal differentiation (Fig. 3K). In stark 
contrast to the 3713-CR persistent clonotypes, 
patient 4000 NeoTCRs did not persist after 
ACT infusion; rather, both infused NeoTCR 
clonotypes rapidly declined in peripheral blood 


after treatment (Fig. 3L). These results support 
our prior observations and other published re- 
ports that have linked cell therapy response to 
posttreatment TCR persistence (33, 42, 43). 

Although our results indicate that T cell- 
intrinsic phenotypic differences are associated 
with persistence after ACT, additional fac- 
tors such as TCR avidity against various tumor 
antigens might also play a profound role in 
T cell persistence. To address this issue, we 
performed an exploratory analysis of T cell 
persistence in a patient who had metastatic 
synovial cell sarcoma and experienced a com- 
plete response after the adoptive transfer of 
autologous peripheral blood mononuclear 
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Fig. 4. Stem-like TILs mediate antitumor activity and TCR persistence. 

(A) Schema for using endogenous human TCR to track NY-ESO-1 TCR-transduced 
(ESO.TCR*) infusion product in posttreatment peripheral blood of a CR patient to 
NY-ESO-1 TCR therapy (ESO.CR). (B) Posttreatment peripheral blood persistence of 
top 20 ESO.TCR” infusion product clones using endogenous human TCR 
according to enrichment in DN (in red) and DP (in black) phenotypes in the TCR 
infusion product. Clones with frequency in DN/frequency in DP > 1 are defined 
as enriched in DN state in the infusion product; clones with frequency in 
DN/frequency in DP < 1 are defined as enriched in DP state in infusion products. 
(C) Violin plot with median showing distribution of clones undetectable at day 7 
after ACT (nonpersistent) (D7) compared with long-term persistent clones 

at day 1846 after ACT (D1846) by the ratio of their clonotypic frequency from 
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Days post T cell transfer 


the infusion product in the CD39 CD69" stem-like state to their clonotypic 
frequency within the CD39°CD69" differentiated state. **P < 0.01 by two-sided 
Wilcoxon rank sum test. (D) Schema for adoptive cell transfer of sorted DN and 
DP Pmel-transgenic T cells into mice bearing established B16 melanoma tumors. 
(E) Tumor growth curves of mice bearing B16 tumors treated with Pmel DN 

or DP T cells in two doses. n = 6 mice per group. *P < 0.05 and **P < 0.01 for 
tumor growth kinetics calculated by Wilcoxon rank sum test. Data show mean + 
SEM at indicated time points. (F) Tumor survival curves of mice bearing B16 
tumors treated with Pmel DN or DP T cells in two doses. (G) Illustration of the 
role of stem-like T cells in immunotherapy and ACT success and paradoxical 
nature of tumor mutation-reactive T cells in stem-like and terminally differ- 
entiated states. Neo.Ag, neoantigen. 
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cells transduced with a TCR targeting a single 
HLA-A*02:01-restricted NY-ESO-1 epitope. We 
used endogenous genomic human TCR6 as 
barcodes to track stem-like DN clones and 
terminally differentiated DP clones within the 
TCR-transduced population in posttreatment 
blood (Fig. 4A) (44). Fourteen out of the top 
20 ESO TCR* infusion-product clonotypes 
were enriched in the stem-like DN state, where- 
aS Six were enriched in the DP state. Stem- 
like clones demonstrated a gradual decrease 
in their frequency over a period of 5 years 
after TCR transfer therapy, whereas the dif- 
ferentiated clones declined rapidly to minor 
frequencies in posttreatment blood of the pa- 
tient (Fig. 4B). Among 1311 top common clones, 
long-term persistent clones (detectable at 51 
months after ACT, 7 = 790) were present at 
significantly higher frequencies in the stem- 
like state in the infusion product when com- 
pared with poorly persistent clones (i.e., those 
undetectable at 7 days after treatment, n = 
122) (P = 0.0035; Fig. 4C). These analyses in- 
dicate that intrinsic stem-like phenotypes can 
modulate the behavior of T cell clonotypes after 
ACT, although larger patient cohort studies in 
TCR and CAR settings are needed to support 
these hypotheses. 

Our findings suggested that treatment with 
CD39 CD69 tumor-reactive TIL might result 
in superior tumor control. We first tested this 
hypothesis in vitro by isolating tumor-reactive 
DN and DP populations from a CR infusion 
product (patient 3733) by coculture with the 
autologous 3733-mel tumor line to select for 
tumor-reactive TILs, followed by multiple 
rounds of rapid expansion to evaluate for 
their proliferative potential and tumor rec- 
ognition (fig. S19A). We found that the ex- 
pansion of the tumor-reactive stem-like DN 
subsets was ~1000-fold higher than the differ- 
entiated DP subset (fig. S19B). The stem-like 
DN subset maintained tumor recognition, 
whereas the differentiated DP subset lost tu- 
mor reactivity after subsequent rounds of ex- 
pansion (fig. S19, C and D). 

To assess the in vivo antitumor effects of 
stem-like DN and differentiated DP subsets, 
we isolated CD39 CD69" and CD39*CD69" CD8* 
T cell populations from in vitro-expanded Pmel- 
transgenic TCR splenocytes and performed 
adoptive transfer of the isolated subsets into 
mice implanted with B16-F10 melanoma engi- 
neered to express the human gp100 antigen 
(Fig. 4D) (45). Whereas the adoptive transfer 
of either 3 x 10° or 5 x 10° differentiated DP 
Pmel T cells had a modest effect on tumor 
control and mouse survival, the transfer of the 
same numbers of stem-like DN T cells led to 
substantial tumor regression and improved 
survival in a dose-dependent manner (Fig. 4, E 
and F). 

Harnessing antitumor T cell reactivity has 
accelerated immunotherapy treatment strat- 
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egies against multiple human cancer types 
(9, 28, 46), but many disease settings currently 
exist outside the realm of approved immuno- 
therapy modalities. In this study, we have 
explored the phenotypic diversity of anti- 
tumor neoantigen-specific T cells in a setting 
of successful immunotherapy. Our results sup- 
port the hypothesis that responders to TIL- 
ACT received infusion products containing a 
pool of stem-like neoantigen-specific TILs that 
are able to undergo prolific expansion, give rise 
to differentiated subsets, and mediate long- 
term tumor control and T cell persistence, in 
line with recent murine ICB studies mediated 
by TCF” progenitor T cells (78, 20, 27). Our data 
also suggest that antitumor neoantigen-specific 
TIL subsets enriched for tumor reactivity (e.g., 
CD39") are likely terminally differentiated TILs 
with a relatively poor proliferative potential 
(Fig. 4G), likely owing to chronic antigenic stim- 
ulation (35, 47). Although we cannot conclude 
that CD39* differentiated subsets are completely 
ineffective because 3 out of 24 patients experi- 
enced complete response with largely termi- 
nally differentiated DP TILs (<2% DN TILs of 
CD3" in the infusion products) (Fig. 1F), it is also 
possible that very low numbers of DN clones 
were sufficient to cause complete tumor regres- 
sion in these patients (48). Our results argue that 
strategies aimed at the isolation and expan- 
sion of stem-like neoantigen-specific T cells, or 
the engineering of T cells to have stem-like at- 
tributes, might provide opportunities for future 
development of more effective T cell-based 
immunotherapies. Within tumor-reactive TIL 
populations, the lower frequency of stem-like 
TILs compared with their largely differentiated 
counterparts, even in CRs, highlights the para- 
dox and challenges of mounting an effective 
antitumor immune response. 
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INDIAN MONSOON 


Indian monsoon derailed by 
a North Atlantic wavetrain 


P. J. Borah’, V. Venugopal’*?*, J. Sukhatme’?, P. Muddebihal’, B. N. Goswami* 


The forecast of Indian monsoon droughts has been predicated on the notion of a season-long 
rainfall deficit linked to a warm equatorial Pacific. Here we show that nearly half of all 
droughts over the past century differ from this paradigm in that they (i) occur when Pacific 
temperatures are near-neutral and (ii) are subseasonal phenomena, characterized by an 
abrupt decline in late-season rainfall. This severe subseasonal rainfall deficit can be 
associated with a Rossby wave from mid-latitudes. Specifically, we find that the interaction 
of upper-level winds with an episodic North Atlantic vorticity anomaly results in a wavetrain 
that curves toward East Asia, disrupting the monsoon. This atmospheric teleconnection 
offers an avenue for improved predictability of droughts, especially in the absence of 

telltale signatures in the Pacific. 


mbedded in the interannual and intra- 
seasonal variability of Indian summer 
monsoon rainfall (ISMR) (1-4) are com- 
plex space-time patterns pertaining to 
its extreme states, namely floods and 
droughts (3, 5, 6). Given that the socioecono- 
mic fabric of one-sixth of the world’s pop- 


ulation is intricately tied to the state of the 
monsoon (7), these extremes, especially droughts, 
have a devastating impact on agriculture and 
the economy. Against this backdrop, despite 
its high potential predictability, the forecast 
of ISMR one season in advance remains a big 
challenge (8-10). For instance, it is known 
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that anomalously warm waters in the equato- 
rial central and eastern Pacific (El Nino) are 
associated with some ISMR droughts (11-15). 
However, what drives droughts not associated 
with an El Nino has remained mostly un- 
known. Further, the prevailing notion is that 
ISMR droughts are characterized by large- 
scale season-long rainfall deficit. Indeed, a 
deeper insight into droughts by way of 
their subseasonal evolution is essential not 
only for a better understanding of these ex- 
tremes but also, practically, for targeted im- 
provement of general circulation models in 
realizing the potential for improved ISMR 
prediction. 

Over the past century, India has experienced 
23 droughts and, as identified in table SI, 13 
of these occurred with, and 10 without, an El 


Centre for Atmospheric and Oceanic Sciences, Indian 
Institute of Science, Bangalore 560012, India. *Divecha 
Centre for Climate Change, Indian Institute of Science, 
Bangalore 560012, India. “Interdisciplinary Centre for Water 
Research, Indian Institute of Science, Bangalore 560012, 
India. “Department of Physics, Cotton University, Guwahati 
781001, India. 

*Corresponding author. Email: venu.vuruputur@gmail.com or 
venu@iisc.ac.in 


Fig. 1. Sea surface temperature and 
seasonal rainfall anomalies for the two 
types of Indian monsoon droughts. 
Spatial distribution of anomalies of 
detrended JJAS SST for the two types of 
Indian monsoon droughts. (A) EN + Dr and 
(B) NEN + Dr. The detrending is based on 
removal of a linear trend for the period 
1901-2015 at each grid location, and the 
maps shown are the average of fluctuations 
around the trend, for the years listed in 
table S1. Based on one-degree monthly 
SST product from the Hadley Centre. 
Color bar is in degrees Celsius. (C) Inter- 
annual variation of anomalies of seasonal 
(JJAS) monsoon rainfall based on the 
Indian Institute of Tropical Meteorology 
homogeneous Indian monthly rainfall 
dataset for the period 1901-2015. The 
horizontal red line at -10% denotes the 
threshold for defining a drought, as 

per the India Meteorological Department 
(IMD). The departures marked in red 
(blue) indicate droughts that occurred 
during an EI Niño (no El Niño) year. 

See methods in the SM. 
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Nino. These two types of droughts are hence- 
forth referred to as EN + Dr and NEN + Dr 
(16), respectively. This sea surface temperature 
(SST)-based classification is confirmed in 
Fig. 1, A and B, which shows the mean of 
detrended SST anomalies during June through 
September (JJAS) for the years listed in table 
S1. Both types of droughts (Fig. 1C), although 
significantly different from normal years, are 
statistically indistinguishable from each other 
at a 5% significance level (P = 0.09) on a seasonal 
scale (fig. S1). This similar final seasonal state 
in the two categories of droughts, in the face of 
disparate oceanic conditions (warm versus 
neutral Pacific), prompts us to investigate their 
subseasonal evolution. 
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Fig. 2. Daily rainfall anomalies and their 
cumulative for the two types of droughts. 

(A) Temporal evolution of anomalies of daily 
area-averaged rainfall over central India (16.5°N 
to 26.5°N, 74.5°E to 86.5°E; land only) during 
JJAS, for the two categories of droughts, along 
with their respective leading harmonics (~120, 
60, and 40 days; thick red and blue curves). 
The thin red and blue curves shown are an 
average of daily anomalies of 13 EN + Dr and 
10 NEN + Dr years listed in table S1. 

(B) Cumulative daily rainfall shown in (A). 

The average daily rainfall anomalies for a 
normal year (based on 18 randomly chosen 
years with a seasonal rainfall anomaly between 
-5 and 5%) and its cumulative are shown as 

a gray line in both panels. Based on IMD 
gridded one-degree daily rainfall data from 
1901 to 2015. See methods in the SM. 


Borah et al., Science 370, 1335-1338 (2020) 


Figure 2A shows the temporal behavior of 
daily anomalies of area-averaged central India 
rainfall [16.5°N to 26.5°N, 74.5°E to 86.5°E; see 
the methods section in the supplementary 
materials (SM)] during JJAS, for the two 
drought types, in comparison to normal years. 
The daily anomalies for each of the drought 
years are shown in fig. S2, from which the 
averages shown in Fig. 2A are estimated. The 
difference in the evolution of the two drought 
types is immediately evident. Specifically, in 
the EN + Dr category (Fig. 2A, thin red curve), 
negative rainfall anomalies, that is, departures 
from the daily climatological mean, set in during 
mid-June and persist till early August. On the 
other hand, in NEN + Dr (Fig. 2A, thin blue 
curve), anomalies have more rapid and intense 
fluctuations culminating in a sharp drop (see 
long break highlighted by the blue ellipse in 
fig. S2B) during mid-August. It is this break in 
late August that drives the monsoon into a 
state of drought. The envelopes (Fig. 2A, thick 
curves) of these two types of droughts, re- 
presented by their respective leading har- 
monics (~120, 60, and 40 days), reinforces 
this distinction. 

The difference in timing and duration of 
the breaks in these two categories is amplified 
in Fig. 2B, which shows the respective com- 
posites of the cumulative daily rainfall anomalies 
(see fig. S3 for individual years). A comparison 
of these composites also shows that NEN + Dr 
has a larger overall deficit than EN + Dr until 
early July. In sharp contrast, during the core 
of the monsoon season (July to mid-August), 
while the deficit persists and deepens in EN 
droughts, rainfall is near to above normal in 
NEN + Dr. This apparent recovery in the 
latter category is interrupted by the sudden 
break in mid-August. Although the timing of 
the deficits may be different, the rate of change 
(drop) in both categories is ~2 to 2.5 mm per 
day (0 to -100 mm in ~45 days during June 
and July for EN + Dr versus —25 to —75 mm in 
20 days during mid to late August for NEN + 
Dr). The final state of the two curves in Fig. 2B 
is in agreement with the fact that, on average, 
the magnitude of the seasonal deficit is larger 
in EN + Dr (17%) than in NEN + Dr (13%), as 
can be inferred from table S1. While the dis- 
cussion so far has revolved around the rainfall 
in central India, the contrast holds for all of 
India as well (fig. S4). 

The temporal evolution of rainfall in the 
two categories of droughts has an associated 
spatial footprint. Figure 3 shows the spatial 
patterns of the composites of cumulative daily 
rainfall anomalies on a 20-day time scale for 
the two categories of droughts. What stands 
out here is that in EN + Dr, negative rainfall 
anomalies appear near the foothills of the 
Himalayas and Gangetic plains (also, West- 
ern Ghats) and then propagate southward. 
Once the deficit sets in, it persists and covers 
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most of the country by the end of July. Despite 
the minor recovery during August in the north 
central plains—corresponding to positive rain- 
fall anomalies in Fig. 2A or, equivalently, the 
flattening of the cumulative (red) curve in 
Fig. 2B—the deficit during June and July is 
sufficient for the season to end in a drought. 
In contrast, for NEN + Dr, below-normal 
rainfall in June is largely confined to the west 
central plains. This is followed by a period 
when most of the country experiences above- 
normal rainfall (Fig. 3, I and J, mid-July to 
mid-August). This path to a near-normal state 
is derailed by a deficit that appears in the 
central plains and rapidly expands to cover 
the entire country within 20 days (Fig. 3K, 
20 August to 8 September). Our analysis thus 
clearly shows that while EN droughts are a 
consequence of a deficit that persists through 
the season, NEN droughts are subseasonal 
phenomena with a moderate early-season 
deficit compounded by an abrupt decline in 
late-season rainfall. 

The apparent stranglehold of the equatorial 
Pacific on Indian monsoon variability (in par- 
ticular, droughts) has been extensively docu- 
mented (13, 14, 17). Here we focus on the 
droughts belonging to the latter category 
(NEN + Dr), which occur in the absence of 
anomalously warm waters in the Pacific (Fig. 
1B). Specifically, we investigate the late-August 
break that derails the monsoon, by consid- 
ering the period when the gradient of rain- 
fall decline is steepest (Fig. 2, blue curves). 
The anticyclonic lower-level circulation over 
India confirms the break period (Fig. 4, C and 
D). Looking outward from India, a standout 
feature in the atmosphere is the presence of a 
strong, upper-level cyclonic circulation to the 
west, in the North Atlantic region (Fig. 4A). 
Note that this summertime feature is deep 
tropospheric (fig. S5) (18, 19) and has a ten- 
dency to occur episodically in June and August 
with a duration of 2 to 3 weeks (fig. S6B, blue 
curve; see also the supplementary text section in 
the SM). 

The interaction of upper-level winds and 
this vorticity anomaly, as seen in the 200 mbar 
meridional wind anomalies (shading in Fig. 
4A), indicates a wavetrain to the east of the 
Atlantic that extends all the way to East Asia 
(20, 21). At 500 mbar (Fig. 4B), the wavetrain 
curves toward the equator, with a prominent 
signature over the Indian region, apparently 
guided by the Tibetan plateau (the narrowing 
of the wind anomalies to the east of 60°E in 
Fig. 4B). Finally, the anticyclonic nature of 
wind anomalies over northern India, signs 
of which are evident at 500 mbar, becomes 
clearer at lower levels (700 and 850 mbar; 
Fig. 4, C and D). This circulation anomaly 
not only weakens the zonal flow over the 
Arabian Sea (i.e., disrupting the Somali jet) 
but also strengthens and spreads to cover all 
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Fig. 3. Spatial distribution of cumulative rainfall anomalies on a 20-day time scale. (A to F) EN + Dr (top two rows) and (G to L) NEN + Dr (bottom two rows). 
The anomalies shown are composites based on 13 EN + Dr and 10 NEN + Dr years, respectively (see table S1). Color bar is in millimeters. 


of India, as one descends from 700 to 850 mbar. 
Contrasting this with the low-level cyclonic 
circulation over India 20 days prior to the break 
(fig. S7) highlights the abruptness of this 
phenomenon (Fig. 3, J and K). Thus, the late- 
season decline in rainfall can be associated 
with a large-scale Rossby wave from the North 
Atlantic to the Indian subcontinent. 

The wide-ranging influence of mid-latitudes 
on the tropics includes the initiation and vari- 
ability of the Indian monsoon (22-24), monsoon 
bursts in Australia (25), and short-duration 
extremes in both hemispheres (26). The atmo- 
spheric teleconnection and its impact identified 
in our work are consistent with the recent 
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recognition of the role played by mid-latitude 
Rossby waves in affecting tropical climate at 
seasonal and shorter-duration time scales 
(26-28). In addition, our diagnostic analysis 
brings forth two intriguing aspects that call for 
further scrutiny: First, most NEN droughts 
seem to coincide with years when the North 
Atlantic was anomalously cold (fig. S6A); and, 
second, there appears to be a significantly 
higher likelihood of episodes of cyclonic vor- 
ticity anomalies over cold summer waters (see 
supplementary text, fig. S6B, and table S2). 
Both of these associations are statistical in 
nature and do not imply causality, especially as 
it is difficult to excite a large-scale atmospheric 
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response to SST anomalies in the mid-latitudes 
(29, 30). Together, they raise the possibility 
that there might be an overarching common 
element behind the excitation and sustenance 
of cold SSTs and uncharacteristically long dura- 
tion vorticity anomalies during the summer in 
the North Atlantic region. 

The association of the Atlantic basin with 
the Indian monsoon in particular has been 
explored in great detail, albeit mostly on 
decadal and interannual time scales (31-36). 
However, the subseasonal atmospheric bridge 
identified in this study represents a poten- 
tially new pathway in our understanding 
of monsoon droughts, especially when the 
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Fig. 4. Wind and vorticity anomalies during the late-August break in NEN droughts. Composites 

of anomalies of (top two panels) wind (vectors) and meridional velocity (shading) at (A) 200 mbar and 
(B) 500 mbar during the first half of the break (days 81 to 90; blue curve in Fig. 2B) in the Indian 
monsoon rainfall; and (bottom two panels) wind (vectors) and vorticity (x 10°; shading) at (C) 700 mbar 
and (D) 850 mbar, based on the respective break periods late in the season for each of the NEN + Dr 
years. The gray lines in the 500 mbar panel represent topography. Based on ERA 20th Century 
Reanalysis data. See methods in the SM. 


equatorial Pacific temperatures are near- 
neutral. More pertinently, it points to the 
need for expanding the existing tropics- 
centered paradigm of monsoon droughts 
into a framework that includes extratropical 
teleconnections (37). 
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Zoonotic introduction of novel coronaviruses may encounter preexisting immunity in humans. Using 
diverse assays for antibodies recognizing SARS-CoV-2 proteins, we detected preexisting humoral 
immunity. SARS-CoV-2 spike glycoprotein (S)-reactive antibodies were detectable using a flow 
cytometry—based method in SARS-CoV-2-uninfected individuals and were particularly prevalent in 
children and adolescents. They were predominantly of the immunoglobulin G (IgG) class and targeted 
the S2 subunit. By contrast, SARS-CoV-2 infection induced higher titers of SARS-CoV-2 S-reactive 
lgG antibodies targeting both the S1 and S2 subunits, and concomitant IgM and IgA antibodies, 
lasting throughout the observation period. SARS-CoV-2-uninfected donor sera exhibited specific 
neutralizing activity against SARS-CoV-2 and SARS-CoV-2 S pseudotypes. Distinguishing preexisting 
and de novo immunity will be critical for our understanding of susceptibility to and the natural 


course of SARS-CoV-2 infection. 


mmune cross-reactivity among seasonally 

spreading human coronaviruses (HCoVs) 

has long been hypothesized to provide ef- 

fective but transient cross-protection against 

distinct HCoVs (, 2). To determine the de- 
gree of cross-reactivity between HCoVs and 
SARS-CoV-2, we developed a flow cytometry- 
based assay for SARS-CoV-2-binding anti- 
bodies. The main target for such antibodies is 
the spike glycoprotein (S), which is proteolyti- 
cally processed into the S1 and S2 subunits, 
mediating target cell attachment and entry, 
respectively. 

The Sl-specific CR3022 antibody stained a 
smaller percentage of SARS-CoV-2 S-expressing 
human embryonic kidney (HEK) 293T cells 
and with lower intensity than COVID-19 con- 
valescent sera (fig. S1), indicating that polyclo- 
nal immunoglobulin G (IgG) antibodies targeted 
a wider range of epitopes naturally processed 
and displayed on these cells. This assay also 
detected SARS-CoV-2 S-reactive IgM and IgA 
antibodies in COVID-19 convalescent sera 


(fig. S2). Indeed, the presence of SARS-CoV-2 
S-reactive antibodies of all three Ig classes 
(IgG*IgM*IgA*) distinguished COVID-19 sera 
from control sera with a high degree of sen- 
sitivity and specificity (Fig. 1A and fig. S3). All 
156 seroconverted COVID-19 patients had con- 
temporaneous IgG, IgM, and IgA responses 
to SARS-CoV-2 S throughout the observation 
period, with the exception of two patients who 
only had IgG antibodies (figs. S4 and S5). One 
of these patients was a bone marrow trans- 
plantation recipient who experienced HCoV 
infection 1 month before SARS-CoV-2 infec- 
tion (fig. S6). Unexpectedly, a small proportion 
of SARS-CoV-2-uninfected patients sampled 
before or during the early spread of SARS- 
CoV-2 in the United Kingdom (table S1) also 
had SARS-CoV-2 S-binding IgG antibodies, 
but not IgM or IgA antibodies (Fig. 1A), sug- 
gesting the presence of cross-reactive immu- 
nological memory. 

The S2 subunit exhibits a higher degree of 
homology among coronaviruses than S1 (fig. 


S7) and was likely the main target of cross- 
reactive antibodies. Competition with re- 
combinant soluble S1 or S2 at doses that 
blocked binding of specific monoclonal anti- 
bodies (fig. S8) did not affect the frequency 
of cells stained with COVID-19 patient sera, 
although the intensity of staining was re- 
duced by 31 and 37%, respectively (Fig. 1, B 
to D), indicating recognition of both S1 and 
S2. By contrast, soluble S2 completely abol- 
ished staining with SARS-CoV-2-uninfected 
patient sera, whereas soluble S1 had no effect 
(Fig. 1, B to D). Thus, SARS-CoV-2-uninfected 
patient sera cross-react with SARS-CoV-2 
$2, and COVID-19 patient sera additionally 
recognize Sl. 

SARS-CoV-2 S-reactive IgG antibodies were 
detected by flow cytometry in five of 34 SARS- 
CoV-2-uninfected individuals with HCoV 
infection confirmed by reverse transcription- 
quantitative polymerase chain reaction, as 
well as in one of 31 individuals without re- 
cent HCoV infection (Fig. 2A and fig. S4A). 
This suggested that cross-reactivity may have 
persisted from earlier HCoV infections rather 
than having been induced by the most re- 
cent one. 

To confirm antibody cross-reactivity using 
an independent assay, we developed enzyme- 
linked immunosorbent assays (ELISAs) using 
recombinant SARS-CoV-2-stabilized trimeric 
S ectodomain, S1, receptor-binding domain 
(RBD), or nucleoprotein (N). Rates of IgG sero- 
positivity by SARS-CoV-2 Sl-coated ELISA 
were congruent with, but generally lower 
than, those by flow cytometry (fig. S9). The 
three SARS-CoV-2-uninfected individuals 
with the highest cross-recognition of S by 
flow cytometry, plus an additional four in- 
dividuals, had ELISA-detectable IgG anti- 
bodies against the SARS-CoV-2 S ectodomain, 
as well as N (Fig. 2A and fig. S4, B to D). By 
contrast, none of the control samples had 
ELISA-detectable IgG antibodies against the 
less-conserved SARS-CoV-2 S1 or RED (Fig. 2A 
and fig. S4, B to D). 

The prevalence of such cross-reactive anti- 
bodies was further examined in additional 
healthy donor cohorts (table S1). Among 50 
SARS-CoV-2-uninfected pregnant women 
sampled in May of 2018, five showed evidence 
for SARS-CoV-2 S-reactive IgG antibodies, but 
not IgM or IgA antibodies (Fig. 2B and fig. 
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Fig. 1. Flow cytometric detection and specificity of antibodies reactive with SARS-CoV-2 S. (A) Detection of 8 50 5 
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S10). In a separate cohort of 101 SARS-CoV-2- 
uninfected donors sampled in May of 2019, 
three had SARS-CoV-2 S-reactive IgG anti- 
bodies (fig. S11) that did not correlate with 
antibodies to the diverse viruses and bacte- 
ria also present in several of these samples. 


SARS-CoV-2 S-reactive IgM and IgA were 
also detected in two of these donors, albeit at 
considerably lower levels than in COVID-19 
patients (fig. S11), suggestive of recent or on- 
going response. In an additional cohort of 13 
donors recently infected with HCoVs, only one 
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Fig. 3. Neutralization of SARS-CoV-2 S pseudotypes and authentic SARS-CoV-2 by SARS-CoV-2- 
infected and —uninfected patient sera. (A) Inhibition of transduction efficiency of SARS-CoV-2 S or VSVg 
pseudotypes by adult COVID-19 patients who seroconverted (SARS-CoV-2" Adults Ab”) or not (SARS-CoV-2" 
Adults Ab ) and SARS-CoV-2-uninfected adult donors (SARS-CoV-2° Adults Ab”) or children and adolescent 
donors (SARS-CoV-2” Children/Adolescents Ab”) with SARS-CoV-2 S-binding antibodies. Each line is 

an individual serum sample. (B) Authentic SARS-CoV-2 neutralization titers of sera from the same donors as 
in (A), as well as SARS-CoV-2-uninfected donors without SARS-CoV-2 S-binding antibodies (Ab’). Dots 
represent individual samples. *P = 0.037; **P = 0.014; ns, not significant by one-way ANOVA on ranks. 
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had SARS-CoV-2 S-reactive IgG antibodies, 
and these were at very low levels (fig. S12). 
This suggested that their emergence was not 
simply a common transient event after each 
HCoV infection in this age group (median age 
51 years; table S1). Instead, given that HCoV- 
reactive antibodies are present in virtually all 
adults (3-5), the rarity of SARS-CoV-2 S cross- 
reactivity (16 of 302; 5.29%) indicates addi- 
tional requirements such as random B cell 
receptor repertoire focusing or frequency of 
HCoV infection rather than time since the last 
HCoV infection. Indeed, the frequency of HCoV 
infection displays a characteristic age dis- 
tribution, being the highest in children and 
adolescents (J, 4-8). We therefore examined 
a cohort of younger SARS-CoV-2-uninfected 
healthy donors (age 1 to 16 years; table S1) 
sampled between 2011 and 2018. At least 21 
of these 48 donors had detectable levels of 
SARS-CoV-2 S-reactive IgG antibodies (Fig. 2, 
C to E), whereas only one of an additional co- 
hort of 43 young adults (age 17 to 25 years; table 
S1) had such antibodies (Fig. 2F). Staining with 
sera from SARS-CoV-2-uninfected children 
and adolescents was specific to HEK293T cells 
expressing SARS-CoV-2 S, but not the unre- 
lated HERV-K113 envelope glycoprotein, and 
was outcompeted by soluble SARS-CoV-2 S2 
(fig. S13). The prevalence of SARS-CoV-2 S- 
reactive IgG antibodies peaked at 62% be- 
tween 6 and 16 years of age (Fig. 2F), when 
HCoV seroconversion in this age group also 
peaks (3, 4, 6, 7), and was significantly higher 
than in adults (P < 0.00001, Fisher’s exact test). 

To determine the potential consequences 
of antibody cross-reactivity, we examined 
the ability of preexisting antibodies to inhibit 
SARS-CoV-2 entry into HEK293T cells (fig. S14 
and supplementary text). Although not ex- 
pected to directly inhibit RBD-mediated cell 
attachment, S2-targeting antibodies that can 
neutralize SARS-CoV-2 have recently been 
discovered (9, 10). HEK293T cell infection 
with SARS-CoV-2 S pseudotypes was efficiently 
inhibited by sera from seroconverted (Ab‘*) 
COVID-19 patients, but not from those who 
had not yet seroconverted (Ab _) (Fig. 3A). Sera 
from SARS-CoV-2-uninfected donors with SARS- 
CoV-2 S-reactive antibodies also neutralized 
these pseudotypes, whereas none of the sera 
neutralized vesicular stomatitis virus (VSV) 
glycoprotein pseudotypes (Fig. 3A). Comparable 
neutralization of SARS-CoV-2 S pseudotypes 
was also observed with sera from SARS-CoV- 
2-uninfected adolescents (Fig. 3A). Moreover, 
most of the sera from SARS-CoV-2-uninfected 
donors with flow cytometry-detectable cross- 
reactive antibodies also neutralized authen- 
tic SARS-CoV-2 infection of Vero E6 cells, 
albeit on average less potently than COVID- 
19 patient sera (Fig. 3B). By contrast, sera 
from SARS-CoV-2-uninfected patients with- 
out cross-reactive antibodies exhibited no 
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Fig. 4. Mapping of cross- 
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neutralizing activity (Fig. 3B). Antiviral anti- 
bodies may also enhance viral entry by Fc 
receptor-mediated antibody-dependent en- 
hancement. However, entry of SARS-CoV-2 S 
pseudotypes was not enhanced by either 
COVID-19 patient sera or SARS-CoV-2- 
uninfected patient sera in FcyRIJA-expressing 
K-562 cells (fig. S15). 

Collectively, these findings highlight func- 
tionally relevant antigenic epitopes conserved 
within the S2 subunit. Over its entire length, 
SARS-CoV-2 S exhibits marginally closer ho- 
mology with the S proteins of the betacorona- 
viruses HCoV-OC43 and HCoV-HKUI than 
with the alphacoronaviruses HCoV-NL63 and 
HCoV-229E (fig. S16A). To probe shared epi- 
topes, we constructed overlapping peptide 
arrays spanning the last 743 amino acids of 
SARS-CoV-2 S (fig. S16B). Multiple putative 
epitopes were differentially recognized by sera 
with cross-reactive antibodies (Ab^), were rea- 
sonably conserved, and most mapped to the 
surface of S2 (Fig. 4, A and B, and table S2). An 
epitope overlapping the S2 fusion peptide was 
also recently identified as being cross-reactive 
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with the corresponding peptides from HCoV- 
OC43 and HCoV-229E (11). Cross-reactivity 
with the identified epitopes was further sup- 
ported by ELISAs coated with synthetic pep- 
tides (fig. S17). 

As expected (3-5), reactivity with one or 
more HCoVs was detectable by flow cytometry 
in all sera (Fig. 4D and fig. S18). However, IgG 
and IgA reactivity against HCoVs was higher 
in SARS-CoV-2-uninfected adults with SARS- 
CoV-2-reactive IgG compared with those with- 
out (P = 1.4 x 10° for IgG and P = 0.017 for IgA, 
Student’s t test) and in SARS-CoV-2-uninfected 
children or adolescents with SARS-CoV-2- 
reactive IgG compared with those without (P = 
0.010 for IgG and P = 0.021 for IgA, Student’s 
t test) (Fig. 4D), supporting a direct link be- 
tween the two. Accordingly, IgG reactivity 
against each HCoV type was independently 
correlated with the presence of SARS-CoV-2- 
reactive antibodies (Fig. 4D). 

Our results from multiple independent as- 
says demonstrate the presence of preexisting 
antibodies recognizing SARS-CoV-2 in unin- 
fected individuals. Identification of conserved 
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epitopes in S2 targeted by neutralizing anti- 
bodies may hold promise for a universal vac- 
cine protecting against current as well as future 
CoVs. Together with preexisting T cell (12-14) 
and B cell 0, 15) memory, antibody cross- 
reactivity between seasonal HCoVs and SARS- 
CoV-2 may have important ramifications 
for natural infection. Epidemiological studies 
of HCoV transmission suggest that cross- 
protective immunity is unlikely to be steriliz- 
ing or long-lasting (8), which is also supported 
by repeated reinfection (2, 16). Nevertheless, 
prior immunity induced by one HCoV can re- 
duce the transmission of homologous and 
heterologous HCoVs and ameliorate the symp- 
toms when transmission is not prevented 
(1, 2). A possible modification of COVID-19 
severity by prior HCoV infection may account 
for the age distribution of COVID-19 suscepti- 
bility, in which higher HCoV infection rates 
in children than in adults (4, 6) correlate with 
relative protection from COVID-19 (17) and 
may also shape seasonal and geographical 
patterns of transmission. It is imperative that 
any effect, positive or negative, of preexisting 
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HCoV-elicited immunity on the natural course 
of SARS-CoV-2 infection be fully delineated. 
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The evolution of a tropical biodiversity hotspot 


Michael G. Harvey'*?*+, Gustavo A. Bravo*>*+, Santiago Claramunt”®°, Andrés M. Cuervo'?™, 
Graham E. Derryberry*"“, Jaqueline Battilana®, Glenn F. Seeholzer?”“, Jessica Shearer McKay”, 

Brian C. O'Meara’, Brant C. Faircloth’*"%, Scott V. Edwards*”, Jorge Pérez-Eman’*”, Robert G. Moyle?®, 
Frederick H. Sheldon’2”%, Alexandre Aleixo’””, Brian Tilston Smith’, R. Terry Chesser’?°, 

Luis Fabio Silveira®, Joel Cracraft®, Robb T. Brumfield'*"*+, Elizabeth P. Derryberry?" + 


The tropics are the source of most biodiversity yet inadequate sampling obscures answers to fundamental 
questions about how this diversity evolves. We leveraged samples assembled over decades of fieldwork to 
study diversification of the largest tropical bird radiation, the suboscine passerines. Our phylogeny, estimated 
using data from 2389 genomic regions in 1940 individuals of 1287 species, reveals that peak suboscine 
species diversity in the Neotropics is not associated with high recent speciation rates but rather with 
the gradual accumulation of species over time. Paradoxically, the highest speciation rates are in lineages 
from regions with low species diversity, which are generally cold, dry, unstable environments. Our results 
reveal a model in which species are forming faster in environmental extremes but have accumulated in 
moderate environments to form tropical biodiversity hotspots. 


ost biological diversity originated in 
tropical regions (J, 2), but long-standing 
questions remain about how tropical 
species diversity forms and is main- 
tained (3). Are tropical peaks in diver- 
sity a result of faster rates of evolution or 
greater time for species accumulation (4)? 
When species do form in the tropics, what is 
the primary driver: climatic instability (5), the 
narrow stratification of habitats along eleva- 
tion gradients (6-8), or divergence caused by 
interactions among species (9, 10)? Alterna- 
tively, recent evidence suggests that speciation 
occurs most frequently when few other species 
are present (11). Might this explanation apply 
even in the species-rich tropics? Addressing 
these questions requires detailed investigation 
of the dynamics of speciation and species 
diversity through time, among lineages, and 
across geographic space in tropical groups. 
This work is particularly urgent because the 
same tropical regions that harbor and gener- 
ate biotic diversity are under rapidly intensi- 
fying human pressure (12). 

Through decades of expeditions and re- 
search in the tropics, our knowledge and 
sampling of tropical bird diversity are finally 
at a level of completeness sufficient for a 





detailed investigation of tropical diversifica- 
tion. Global avian diversity reaches its peak in 
the New World tropics, and a large portion of 
that diversity is attributable to a single massive 
radiation: the suboscine passerines (class Aves, 
suborder Tyranni). Roughly one in three 
Neotropical bird species (1250 of 4192 total) is 
a suboscine, even though the group is only 40 
to 51 million years (Ma) old (13-15). Sub- 
oscines have varied phenotypes and ecologies 
and are the predominant avian group in es- 
sentially all terrestrial habitats in the Neo- 
tropics, from the Andean snow line to lowland 
Amazonia and from cloud forests to deserts 
(16). However, a comprehensive species-level 
estimate of suboscine phylogeny has not been 
available, precluding our understanding of 
diversification in this large tropical radiation 
and limiting identification of the broader 
drivers of tropical evolution. 

We assembled a large, complete phyloge- 
nomic dataset [table S1 (77)] containing 1940 
samples representing 1287 of 1306 suboscine 
species (98.5%) sequenced at 2389 orthologous 
genomic regions. Our maximum-likelihood 
phylogenetic tree (Fig. 1) was highly resolved. 
Relationships across the tree were supported 
by markers throughout the genome, with 
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elevated support at sex-linked loci and high 
bootstrap support at most nodes. The tree 
topology and branch lengths were highly 
concordant between datasets with minimal 
and extensive filtering of missing data (fig. 
S1). A concatenated maximum-likelihood tree 
and a coalescent-based species tree account- 
ing for heterogeneity in phylogenetic histories 
across the genome recovered highly similar 
relationships [figs. S2 and S3 (17)]. Time 
calibration of the phylogeny using existing 
fossil records within and outside suboscines 
indicates that the extant suboscine radiation 
began diversifying 44.5 Ma ago (range, 42.2 
to 45.7 Ma ago) and individual suboscine 
families originated 18.7 to 36.5 Ma ago (stem 
ages; table S2). This time-calibrated tree pro- 
vided the information needed for estimating 
the dynamics of suboscine diversification across 
time, lineages, and geography and for testing 
links between those dynamics and potential 
drivers of tropical diversity. 

We found that suboscine diversification has 
been relatively steady since the origin of the 
group. Extant suboscine lineages have accu- 
mulated gradually, and our intraspecific sam- 
pling reveals continued accumulation through 
the present (Fig. 1). Diversification-through-time 
analyses did not find evidence for shifts in 
diversification rate over most of the history of 
the group [Fig. 2A (17)] aside from a drop 
within the past 2 Ma likely resulting from 
unsampled intraspecific diversity and unsorted 
ancestral polymorphism. Diversification models 


Fig. 1. Species-level phylogeny of suboscine birds. 
The three infraorders are colored and the 24 families 
are outlined with white polygons. Smaller families 
are numbered as follows: (1) Philepittidae, (2) 
Eurylaimidae, (3) Calyptomenidae, (4) Sapayoidae, 
(5) Pittidae, (6) Onychorhynchidae, (7) Oxyruncidae, 
(8) Pipritidae, (9) Platyrinchidae, (10) Tachurisidae, (11) 
Melanopareiidae, (12) Conopophagidae, (13) 
Formicariidae, and (14) Scleruridae. Nodes that are 
at the crowns of families or older are annotated 
with circles that indicate bootstrap support (see 
gradient scale inset) and error bars depicting 
uncertainty in their ages. Above the phylogeny, a 
time axis and lineage-through-time (LTT) plots are 
depicted. The gray LTT plot depicts currently 
recognized species, and the black area near the top 
depicts the accumulation of additional lineages 
within currently recognized species. The inset in 
the bottom left is a Manhattan plot of gene tree 
support for the consensus topology at loci across 
the genome. Points are colored by chromosome 
(mapped to the zebra finch Taeniopygia guttata), 
alternating gray and black, and chromosomes 

are ordered by size (some smaller chromosomes 
are not labeled). Support is higher at sex-linked 
loci than at autosomal loci (t = —10.3, P < 0.01). 
Representative bird images for each family are 
illustrations reproduced by permission of 

Lynx Edicions. 
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Fig. 2. Temporal, taxonomic, and geographic dynamics of suboscine 
diversification. (A) Suboscine speciation is relatively stable over most of the 
group's history, lacking an early burst. (B) Speciation rate varies widely 
among lineages, and five groups are consistently inferred to exhibit shifts 
to elevated speciation rates relative to background rates. Branches are 
colored according to the equal splits speciation rates estimator (ES). Groups 
outlined with polygons share an elevated speciation rate relative to the 
background regime in the best shift configuration from Bayesian analysis of 
macroevolutionary mixtures (BAMM) and are supported by other models with 


branch-specific diversification rates (see the supplementary text). The groups 
are labeled with the name of the most species-rich family contained therein. 
(C) Suboscine species richness is highest in the Neotropics. (D) Recent 
suboscine speciation rates (based on ES) do not peak in Neotropical centers 
of species richness. (E) Areas with high richness have moderate speciation 
rates and moderate diversity age, here measured using average node height. 
To avoid the excessive contribution of wide-ranging species to perceived 
spatial patterns, cell values in the maps of evolutionary statistics (D and E) 
represent averages weighted by the inverse of species range size. 


fit to the suboscine phylogeny as a whole in- 
dicate that constant-rate models fit better than 
models with decreasing diversification rates 
through time [birth-death log likelihood = 
-3626.1; diversity-dependent log likelihood = 
-3658.7; change in corrected Akaike informa- 
tion criterion (AAICc) = 67.18]. Recent work has 
highlighted challenges associated with identify- 
ing alternative diversification histories in phy- 
logenies of extant taxa (18), but even identifiable 
estimators do not show major shifts through 
time in suboscines (fig. S4). The steady dy- 
namics in suboscines contrast with the pat- 
tern found in many other groups, in which an 
early burst of diversification is followed by a 
gradual slowdown (19). There are multiple 
potential causes for the absence of rate shifts 
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(20, 21). For example, tropical clades may have 
been less subject to bursts of speciation asso- 
ciated with episodic ecological opportunity, the 
suboscine radiation may be old enough that 
the signature of an early burst has eroded, or 
suboscines may represent an assemblage of 
smaller radiations with distinct dynamics that 
produce the overall trend [figs. S5 to S7 and 
tables S3 and S4 (17)]. 

Recent speciation rates varied >100-fold 
among different evolutionary lineages of sub- 
oscine birds (Fig. 2B). We focused on estimators 
of speciation rates that are most influenced 
by recent branching events and are thus ro- 
bust to assumptions about extinction, but we 
also compared these with other model-based 
estimates (17). Five groups in particular were 
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responsible for 77.4% of suboscine diversity 
and were consistently inferred to have ex- 
perienced a shift to higher rates of diversifi- 
cation by a suite of methods [Fig. 2B (17)]. These 
include the lineages containing manakins (Pip- 
ridae), tyrant flycatchers (Tyrannidae), antbirds 
(Thamnophilidae), tapaculos (Rhinocrypti- 
dae), and ovenbirds (Furnariidae), groups that 
are highly distinct from one another in ecol- 
ogy and morphology (16). Future work, how- 
ever, may reveal shared attributes that have 
contributed to elevated diversification across 
these groups. 

When mapped onto geography, the Neo- 
tropical regions with the greatest suboscine 
species richness (Fig. 2C) do not contain 
lineages with the highest recent speciation 
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Fig. 3. Examination of the drivers of suboscine speciation rate variation 
reveals the role of low species diversity. (A) Plot of speciation rate (based 
on ES) versus species diversity for all suboscines showing that areas of 

high species diversity contain mostly lineages with low speciation rates. The 
negative speciation rate—diversity association is found both by examining all 
suboscines and by focusing on only those in the Neotropics. (B) The best- 
fitting model from phylogenetic path analysis includes strong associations 
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between environmental variables and species diversity and then an impact of 
species diversity on speciation rate. Species diversity is the variable with 
the greatest impact (standardized association deviating from zero) on 
speciation rate (based on ES). The full model slightly outperforms a model in 
which the environment mediates speciation rate entirely through its impact 
on species diversity, and substantially outperforms a model in which 
environment impacts speciation rate directly. 


rates (Fig. 2D) but instead exhibit moderate 
speciation rates and harbor species diversity 
that is older than in surrounding areas (Fig. 
2E). This result is concordant with recent 
latitudinal studies of diversification in wide- 
spread samples of birds, mammals, and fishes 
that have found speciation rates in the tropics 
similar to or lower than those in the Tem- 
perate Zone (22-24). The spatial patterns in 
speciation rate and species age that we ob- 
served were consistent with the results of 
biogeographic modeling of the colonization 
history of suboscine lineages: Old World spe- 
cies were contained within a single, old clade 
of modest diversity, and Nearctic species de- 
rived from 15 to 20 recent (all <10.8 Ma ago) 
colonizations of the Temperate Zone [fig. S8 
(17]. Moreover, the transition of lineages into 
the Neotropics has only occurred four to nine 
times, which indicates that colonization is 
not a primary driver of elevated Neotropical 
diversity. Although the potential evolutionary 
importance of time has long been recognized 
(25, 26), its contribution in suboscines further 
establishes a key role in the accumulation of 
tropical species. Species-rich regions in the 
tropics are not merely a locus of young di- 
versity but contain diversity accumulated over 
a protracted evolutionary period. 


Harvey et al., Science 370, 1343-1348 (2020) 


We failed to find strong associations be- 
tween speciation rates in the New World and 
environmental variables (table $5). Speciation 
rate was not correlated with elevation (p = 
0.187; P = 0.100), nor did species from moun- 
tainous regions exhibit higher speciation rates 
than those from lowland regions (Ag = 0.176 
species/Ma; A; = 0.200; P = 0.933). This result 
is notable in light of evidence of positive as- 
sociations between speciation rates and eleva- 
tion in prior studies (7, 8, 27). Elevation may 
not be a strong and general predictor of spe- 
ciation rate variation. We also found few strong 
associations between speciation rate and cur- 
rent climatic variables such as temperature, 
precipitation, seasonality, or average rates of 
climatic change since the last glacial maximum 
or across five time slices since the Pliocene 
(table S5). 

A key result from our study is that the best 
predictor of elevated speciation rates in New 
World suboscines is low current standing spe- 
cies diversity (Fig. 3). Higher speciation rates 
were observed in lineages with ranges that 
overlapped few other suboscines (p = -0.247; 
P = 0.007; Fig. 3A) or few overall bird species 
(p = -0.223; P = 0.021). This result was robust 
to phylogenetic uncertainty, incomplete sam- 
pling, simulated random and nonrandom 
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extinction, alternative taxonomic classifica- 
tions, and the effects of spatial autocorrelation 
in species richness estimates [table S6 (17)]. 
Moreover, this result was also evident using 
state-dependent speciation and extinction 
models and traditional sister clade compar- 
isons [table S7 and figs. S9 and S10 (17)]. The 
negative speciation-species richness associa- 
tion runs counter to the idea that greater 
potential for species interactions promotes 
speciation in the tropics (8). It instead sup- 
ports recent arguments that the geographic 
dynamics of species formation at a given time 
are a response to rather than a cause of broad 
species diversity patterns (11, 28). In this 
framework, new species form at higher rates 
in areas containing relatively few species or 
where there is a diversity debt relative to the 
environmental capacity for species. 

A model including variable extinction rates 
suggested that areas of low diversity have 
experienced historically high levels of extinc- 
tion (table S7), which may help explain the 
low levels of standing diversity in those areas 
today. This supports prior work suggesting 
that extinction may overwhelm speciation as 
a driver of patterns in species richness (22, 29). 
In suboscines, areas of low diversity were as- 
sociated with environments characterized by 
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low temperatures and precipitation, greater 
seasonality in temperature and precipitation, 
and larger changes in climate through geo- 
logic time (table S5). These may be areas in 
which the contemporary habitat is younger 
and lineages are recent colonists from else- 
where. Areas of low species richness and high 
speciation rates included parts of the cen- 
tral and southern Andes and Patagonia, the 
Caribbean, and the Nearctic (Fig. 2, C and 
D). These are also areas found to result in 
low simulated species persistence under recent 
(1.e., the past 800,000 years) paleoclimatic 
models (30). We used phylogenetic path 
analysis to simultaneously evaluate causal 
connections among different environmental 
and biotic variables. The best model contained 
causal links between climatic variables and 
species richness and then a link between spe- 
cies richness and speciation rates (Fig. 3B). 
This result indicates that environmental var- 
iables mediate the number of species in an 
area, Which in turn drives speciation rate 
variation. 

By characterizing the dynamics of diversity 
through time, among lineages, and across space 
in a major tropical group, we were able to ad- 
dress long-standing questions about tropical 
evolution. We found that diversity in a tropical 
group has accumulated over a protracted pe- 
riod, and that the hotpots of species diversity 
in the tropics are associated with time for 
species accumulation rather than exceptional 
speciation rates in those areas. We provide 
evidence that the environment mediates spe- 
ciation dynamics through this filter of species 
richness. Overall, extreme environments ap- 
pear to limit diversity but provide enhanced 
opportunities for speciation, whereas moder- 
ate environments reduce species forma- 
tion within lineages but permit diversity to 
accumulate. 
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PALEOCLIMATE 


Southern Ocean upwelling, Earth’s obliquity, 
and glacial-interglacial atmospheric CO, change 


Xuyuan E. Ai’2*, Anja S. Studer’, Daniel M. Sigman’, Alfredo Martinez-Garcia*, Francois Fripiat*, 
Lena M. Théle*®, Elisabeth Michel’, Julia Gottschalk®+, Laura Arnold’, Simone Moretti, 
Mareike Schmitt’, Sergey Oleynik’, Samuel L. Jaccard?+, Gerald H. Haug”? 


Previous studies have suggested that during the late Pleistocene ice ages, surface-deep exchange was 
somehow weakened in the Southern Ocean’s Antarctic Zone, which reduced the leakage of deeply 
sequestered carbon dioxide and thus contributed to the lower atmospheric carbon dioxide levels of the 
ice ages. Here, high-resolution diatom-bound nitrogen isotope measurements from the Indian sector of 
the Antarctic Zone reveal three modes of change in Southern Westerly Wind-driven upwelling, each 
affecting atmospheric carbon dioxide. Two modes, related to global climate and the bipolar seesaw, have 
been proposed previously. The third mode—which arises from the meridional temperature gradient as 
affected by Earth’s obliquity (axial tilt)—can explain the lag of atmospheric carbon dioxide behind 
climate during glacial inception and deglaciation. This obliquity-induced lag, in turn, makes carbon 
dioxide a delayed climate amplifier in the late Pleistocene glacial cycles. 


he past ~1 million years of Earth history are 

characterized by large glacial-interglacial 

cycles with an average period of 100,000 

years. The cycles originate from subtle 

variations in Earth’s orbital parameters, 
which affect the seasonal and spatial distribu- 
tion of solar insolation on Earth’s surface (J). 
However, the direct effects of the orbital changes 
on global, annually averaged insolation are 
minor. Thus, feedbacks within Earth’s climate 
system are thought to rectify and amplify the 
insolation changes into large, global climate 
changes. The concentration of the atmospheric 
greenhouse gas CO, varied by ~30% over the 
glacial-interglacial cycles (2). Accordingly, it is 
suspected that CO, was a key amplifier in the 
generation of these climate cycles; however, 
the causes of the CO, variation remain wide- 
ly debated. 

The ocean’s biological carbon pump—the 
production of organic carbon by phytoplank- 
ton in the surface ocean and its sinking and 
remineralization—sequesters CO, in the deep 
ocean, which lowers the concentration of CO, 
in the atmosphere. The phytoplankton that 
underpin the biological pump require nu- 
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trients, namely nitrate and phosphate, which 
are supplied from the ocean interior and are 
returned to it in the sinking organic matter. 
In the Southern Ocean, where phytoplankton 
growth is limited by iron and/or light (3, 4), 
the deeply stored nutrients and CO, brought 
to the surface by upwelling and surface-deep 
exchange are not fully consumed. This results 
in high concentrations of residual nutrients 
and the outgassing of CO, from the ocean 
to the atmosphere—i.e., a leak in the global 
ocean’s biological pump (5). Different mech- 
anisms have been proposed by which the 
Southern Ocean leak in the biological pump 
may have been stemmed during glacial inter- 
vals. First, increased sea ice cover could have 
lowered atmospheric CO, by preventing the 
degassing of CO, from underlying surface wa- 
ters (6). Second, iron fertilization could have 
driven more-complete consumption of upwelled 
carbon and nutrients (7). Third, a slowing of 
deep-water ventilation in the Southern Ocean 
could have, in its own right, reduced the re- 
gion’s release of biological pump-sequestered 
CO, to the atmosphere while also encouraging 
more-complete nutrient consumption, which 
would have further decreased the CO, release (8). 

During the ice ages, higher productivity co- 
incided with high dust supply in the Sub- 
Antarctic Zone (SAZ) of the Southern Ocean 
(9-11), which together offer support for the 
Southern Ocean iron fertilization hypothesis 
(7). However, decreased productivity is found 
in the glacial Antarctic Zone (AZ), the more 
polar domain of the Southern Ocean (10-12). 
Information on surface ocean nutrient condi- 
tions is needed to determine the causes of 
these productivity changes and their impli- 
cations for atmospheric COs. 

When phytoplankton grow, they preferen- 
tially consume “N-bearing nitrate (“NOs ) 
relative to '’N-nitrate (°NO3 ), such that the 


11 December 2020 


consumption of nitrate leads to the progressive 
elevation of nitrate §”N {5”°N [%o (per mil)] = 
[C°N/ MN caine (°N/ “N)atmn2 7 1] E 1000}, 
which is in turn reflected in the 5”°N of plank- 
ton biomass (13). In the SAZ during ice ages, 
an increase in foraminifera-bound 5’°N (5"°Ns,) 
indicates more-complete nitrate consumption; 
its coincidence with the elevated dust supply and. 
greater biological export production strengthens 
the evidence for iron fertilization in the SAZ (14). 
Records of bulk sediment 8’”°N and diatom- 
bound 8”°N (8 Na) suggest that surface ni- 
trate was also more fully consumed in the AZ 
during glacial periods (15-17). More-complete 
nitrate consumption, in the context of lower 
biogenic fluxes, points to a decline in gross ni- 
trate supply to the surface waters of the glacial 
AZ (17). However, the cause of this change is 
still under discussion. 

From the Indian sector of the AZ, we re- 
port two 8 Na» records with (sub)millennial 
resolution over the last glacial cycle. MD12- 
3394 is near the northern edge of the AZ, and 
MD11-3353 is slightly poleward and closer to 
the Kerguelen Islands (Fig. 1). Chronology is 
on the basis of the alignment of organic bio- 
marker (TEX{)-based sea surface temperature 
(SST) to the Antarctic temperature stack (ATS) 
(18) or the European Project for Ice Coring in 
Antarctica (EPICA) Dome C temperature (19), 
with seven “C ages during the Holocene. The 
marine isotope stage (MIS) 3 interval of MDI- 
3353 is tuned by two additional 5”Na, tie-points 
to MD12-3394 (see supplementary materials). 
The reliability of these chronologies is sup- 
ported by comparison of planktonic sO to 
the ATS (fig. S3). 

During the last ice age, in both MD12-3394 
and MD11-3353, 8”’Nap increased progressively 
from the penultimate interglacial to the Last 
Glacial Maximum (LGM), reaching values that 
are 3 to 4%o higher than those of the inter- 
glacial (Fig. 1C). The records agree well with 
the previously published 5”Ng, record of PS75/ 
72-4: from the Pacific sector of the AZ (15) (Fig. 
1C) and with deep sea coral-bound 8”°N from 
the Drake Passage (20). Together, the datasets 
indicate a high degree of nitrate consumption 
[nitrate concentrations lower than present 
(21)] across a vast expanse of the glacial AZ. 

The "Na records reveal three mechanistic 
modes of change in AZ nitrate supply, which 
point to changes in the Southern Westerly Winds 
(SWW) as their driver, as described below. In 
turn, the AZ changes can explain key features in 
the ice core record of atmospheric CO, and thus 
clarify the role of atmospheric CO, in glacial- 
interglacial cycles. We introduce each mode in 
terms of greater nitrate supply and thus lower 
AZ5”Nap (which is plotted upward in all figures). 

Mode 1 is driven by global mean climate. The 
maxima in global warmth of MIS 5e, MIS 5c, 
MIS 5a, and MIS 1 (22, 23) (Fig. 2C, pink bars)— 
which occur during 65°N summer insolation 
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Fig. 1. High-resolution Antarctic diatom-bound 5°N records through the last gla- 
cial cycle. (A) TEXg¢ —based SST of MD12-3394 (dark blue) and MD11-3353 (light blue) 
plotted over the ATS from ice cores (18) (gray). (B) Atmospheric CO; (47). (C) &°Nap 
of MD12-3394 (dark green), MD11-3353 (grass green), and PS75/72-4 (15) (light green). 
The age models (see text and supplementary materials) are based on correlation to 


maxima (Fig. 2A) (1)—correspond to low 8’ Ng, 
and thus high AZ nitrate supply (Fig. 2D). For 
the northern part of the AZ, the region of our 
5“°Nap records, the import of deep nitrate into 
the surface layer is associated with the upper 
cell of Southern Ocean overturning: the upwell- 
ing and northward transport of water at the 
surface caused by the SWW. In climate models, 
the SWW can respond to global warming with a 
poleward shift, and this may be associated with 
an increase in wind strength (24). Both changes 
lead to stronger wind stress at the latitudes of 
the Drake Passage, which in turn strengthens 
wind-driven northward surface transport and 
upwelling in the AZ (25). The wind-driven 
northward transport in the AZ is counteracted 
by southward eddy advection, such that the 
net flow of the upper cell is consistent with air- 
sea buoyancy fluxes. Nevertheless, the eddy 
compensation is partial (26), with models in- 
dicating that a change in the SWW-driven 
northward transport would cause a change 
in net AZ upwelling of the same sign (27). 
For the northern part of the AZ, where our 
ô” Nap records were recovered, the observed 
amplitude of 5'°Ng, elevation during the 
glacial maxima requires a >’75% reduction 
in nitrate supply (27). Such a large change 
favors the interpretation that a weakening 
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and/or equatorward displacement of the SWW 
caused upwelling to decline in the glacial 
AZ (28). 

Mode 2 is driven by the bipolar seesaw. 
During MIS 3, 5’°Ngp in MD12-3394: covaries 
with the millennial-scale Antarctic isotope 
maximum (AIM) events (Fig. 1, D to F, and 
Fig. 2, blue bars). Lower ô Nap is associated 
with higher Antarctic air temperature, which 
suggests that during the associated Heinrich 
events, while the Southern Ocean warms (29, 30), 
AZ surface nitrate supply rises. Our data are 
consistent with the interpretation that a re- 
duction of North Atlantic overturning during 
Heinrich events caused a southward shift and 
strengthening of the SWW (31), which lead to 
increased AZ upwelling (32). 

For AZ upwelling to become stronger, both 
modes described above involve the warming 
of the Southern Ocean, which should be re- 
flected in Antarctica. If the combination of the 
two modes is sufficient to explain the changes 
in the strength and position of the SWW and 
thus the changes in upwelling strength, then 
nutrient conditions in the AZ should be closely 
coupled with Antarctic ice core temperatures. 
We calculate the predicted 85Na» under the 
assumption that 5’°Nq, varies linearly with 
the ATS (Fig. 2D) [(18); see materials and 
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Antarctic ice core temperature and are thus not fully independent. (D, E, and F) Expansion 
of MD12-3394 SST (dark blue), ATS (gray), CO> (black), and MD12-3394 5°Nq, (dark 
green) from 20,000 to 70,000 years ago. (G) Map of austral summer surface nitrate 
concentration (48) and the Antarctic Polar Front (49) (white line) with core sites indicated. 
ka, thousand years ago. The marine isotope stages are inferred from the ATS. 


methods]. The difference in the measured 
ô Nap from this predicted Nap record is 
quantified as the 8 Nap offset (Fig. 2G). The 
resulting ô Na offset record leads us to pro- 
pose a third mode of 8’”’Ng, change driven by 
changes in obliquity, as explained next. 

Independent of the absolute temperature 
at high latitudes, ocean-only climate models 
indicate that a steeper temperature gradient 
between the middle and high latitudes results 
in a strengthening of the SWW (24). Low ob- 
liquity increases the insolation gradient, lead- 
ing to a stronger temperature gradient between 
the middle and high latitudes (33, 34), which 
appears to be reflected in the temperature dif- 
ference between the moisture source region 
and ice core site as estimated from deuterium 
excess in the Antarctic ice cores (35, 36) (Fig. 2F). 
In turn, a stronger temperature gradient yields 
stronger SWW, which enhance AZ upwelling 
(34). Consistent with the operation of a third, 
obliquity-driven mode of AZ upwelling change, 
the ô Nap offset covaries with obliquity and 
with the temperature difference between mois- 
ture source and site in Antarctic ice cores: Neg- 
ative 8”Nap offsets (higher nitrate supply than 
predicted) are associated with obliquity min- 
ima and larger temperature differences (Fig. 2, 
E to G, and fig. S6). 
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Fig. 2. Evidence for three MIS1 | 2 
modes of change in SWW- A 
driven upwelling in the ‘as 
AZ. (A) Daily mean top-of- 
atmosphere insolation at 65°N 
summer solstice (50). (B) ATS 
(15). (C) Two reconstructions 
of global SST (22, 23). 

(D) Nap from MD12-3394 
(light green circles), the mean 
of & Nap from MD12-3394 
and MD11-3353 calculated 
using a Monte Carlo and 
Kalman filter method (20) 
with a resolution of 500 years 
(dark green line; fig. S1), 

and the predicted 5°Nq, if it 
strictly follows ATS (dotted 
line; see materials and 
methods). (E) Obliquity, with 
solid or hatched orange 
Shading indicating lower or 
higher than the average of the 
past 150,000 years, respec- 
tively (51). (F) Changes in 

the difference between the air 
temperature at the moisture 
source (T source) and the ice core 
Site (Tite) of Vostok (35) 

(dark gray) and Dome Fuji 
(36) (light gray) relative to 8 ep 
modern conditions, recon- 

structed from ice deuterium 

excess. Solid or hatched orange 

Shading indicates higher or 

lower Tsource — [site than 

the average of the past 

150,000 years at Vostok, 

respectively. (Q) Nap offset 

(measured minus predicted 
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5 Nap; see materials and methods) calculated using the &°Ny, of MD12-3394 (gray line) and the mean &°Ny, from 
the two records (black line). Solid or hatched orange shading indicates a negative or positive 5°Nay offset, 
respectively, which implies greater than or less than predicted AZ upwelling, respectively. Illustration at the bottom 
Shows the Westerly Wind belts and their changes under the three mechanistic modes. For modes 1 (pink) and 

2 (blue), colors correspond to the shaded intervals in the plot, all shown for an increase in AZ upwelling (i.e., a 
decrease in 5°N4,). For mode 3, in the illustration and (G), solid orange or hatching, respectively, symbolizes stronger 
or weaker upwelling due to stronger or weaker meridional temperature gradient and thus stronger or weaker SWW. 


The strongest negative 5'’Nq, offset occurs 
at the end of the penultimate interglacial 
through the glacial inception at the MIS 5e- 
MIS 5d transition. From 125,000 to 115,000 
years ago, global average SST declines by ~1°C 
(Fig. 2C) and extratropical SST declines by 
~1.5°C (37), which should lead to less AZ 
upwelling by mode 1. In the high latitudes, 
the average SSTs of both 50 to 60°S and 50 to 
60°N drop by 2 to 2.5°C (22); thus, there is no 
sign of a hemispheric asymmetry that might 
shift the SWW poleward and maintain AZ up- 
welling by mode 2. Yet, 85 Nap remains at low 
interglacial values or even decreases slightly 
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from 125,000 to 115,000 years ago, which in- 
dicates sustained AZ upwelling. Similarly, in 
MIS 5d, both Antarctic ice core temperatures 
and North Greenland Ice Core Project (NGRIP) 
temperature as well as AZ SST drop to values 
comparable to those observed in the LGM 
(38, 39), and reconstructed global average 
SST shows an ~50% cooling toward the LGM 
level (22, 23) (Fig. 2, B and C). However, 8’ Nap 
only increases ~30 to 35% toward its full 
glacial value (Fig. 2D). The maintenance of 
low 8'°Nqp from 125,000 to 115,000 years 
ago and the relatively small 8 Nap increase 
across MIS 5d (Fig. 2, D and G) suggest that 
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SWW-driven upwelling during these intervals 
is stronger than the scenario in which AZ 
upwelling declines in lockstep with Antarctic 
cooling. 

This stronger-than-expected upwelling can 
be explained by mode 3. Low obliquity (Fig. 2), 
which steepens the meridional temperature 
gradient (Fig. 2F) and strengthens the SWW, 
appears to have counteracted the effect of global 
cooling on SWW-driven upwelling. Similar 
to the end of the penultimate interglacial, the 
current interglacial (the Holocene) is charac- 
terized by declining obliquity (Fig. 2E). Largely 
because of this trend, climate models simu- 
late a SWW-driven increase in upwelling since 
7000 years ago (40). Consistent with this, 8 Nap 
and the &”’Nq, offset decline over the Holocene 
(41) (Fig. 1C and Fig. 2G), which affirms the 
proposed role of obliquity (i.e., mode 3) in 
Southern Ocean upwelling. 

Due to mode 3, high obliquity often corre- 
sponds to intervals of positive 85 Nap offset, 
especially during deglaciations. The deglacial 
warming of the Antarctic region due to modes 
1 and 2 tends to cause the SWW to shift pole- 
ward and strengthen in the AZ. However, at 
the same time, high obliquity (mode 3) works 
against modes 1 and 2, weakening the SWW 
and depressing upwelling in the AZ, which 
reduces the decline in 8N and thus increases 
the 5’°Nap offset (Fig. 2, D and G), while it 
decreases the temperature difference between 
moisture source and ice core site (Fig. 2, E and 
F). Modes 1 and 2 appear to dominate the change 
in the SWW and thus drive stronger AZ up- 
welling during the deglaciations. Nevertheless, 
the counteracting effect of mode 3 is reflected 
in the late decline in 8 Neap at termination 2 
and the weak decline in 5”Ng, compared with 
the rise in Antarctic air temperature at termi- 
nation 1 (Fig. 2D and fig. S8). Finally, other in- 
tervals during which changes in 8”’Nq, deviate 
from those in Antarctic temperature—such as 
MIS 4 with its relatively low 5'°Ng, and the 
interval of relatively high 8’°Ngp at 48,000 to 
57,000 years ago during MIS 3—can also be 
explained by mode 3 (Fig. 2, E to G, and the 
supplementary materials). 

The AZ data, in combination with proxies 
from the SAZ, show that changes in Southern 
Ocean surface nutrient conditions can explain 
much of the ice age atmospheric CO, draw- 
down, with the AZ and SAZ having comple- 
mentary correspondences with the CO, declines 
of the last glacial progression. During MIS 5, 
CO, is more closely correlated with AZ 8’ Nap, 
than with Antarctic air temperature (Fig. 3, 
A, B, and I, and fig. S4). This is particularly 
apparent for the glacial inception (108,000 to 
124,000 years ago; Fig. 3, F to I). The similar 
timing and proportionality of change in 8”’Nq, 
and CO, at the MIS 5e-MIS 5d transition sug- 
gest that weaker upwelling and reduced CO, 
venting in the AZ play an important role in the 
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Fig. 3. Role of the AZ in atmospheric 
CO. change. (A) ATS (gray) (18) and 
atmospheric (atm.) COQ» (black) (47). 


€ 280 
(B) Indian AZ mean Nap (dark green S 
solid line), MD12-3394 8°Nq, (light 5 240 
green circles), and the predicted &!®Nap O 
(dotted line). (C) 8!°Np, (blue circles) and £ 770 
iron flux (gray line) at ODP Site1090 in = 
the SAZ (14). (D) 5°Nq, offset (measured 
minus predicted &'°Nu; see materials 
and methods) calculated using MD12- as 
3394 5Nap (gray circles) and the Indian È? 
AZ mean Na, (black line). (E) CO» Eo 
offset from a linear regression between 3 
the atmospheric CO» composite and Rr 10- 
ATS (see materials and methods). The g 15 
shaded intervals of (A) to (E) correspond i 20 
to 72,000 to 124,000 years ago (gray) < 25 - 


and 20,000 to 72,000 years ago (purple), 
respectively. (F, G, and H) An expansion 
of 95,000 to 125,000 years ago, covering 
the last glacial inception and including 
the three &'°Nap records from Fig. 1. In (F), 
EPICA Dome C sea salt-Na flux (blue) 
(42) is plotted over ATS (dark gray) and 
EPICA Dome C air temperature (light 
gray) (38). The gray shading corresponds 
to the interval of CO» decline at the MIS 
5e-MIS 5d transition. The hashed bar at 
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the top indicates the glacial inception of 108,000 to 124,000 years ago. (I) Cross plot of CO» versus Indian AZ mean 8'°Nqp. Colors follow shaded intervals in 

(A) to (E). Connected markers are data for the glacial inception, indicated by the hashed bar in (F) to (H). Gray line is the trend fitted through the data of the interglacials 
(O to 10,000 years ago and 115,000 to 126,000 years ago) and glacial maxima (20,000 to 27,000 years ago and 140,000 to 155,000 years ago). Deglacial intervals 
(126,000 to 140,000 years ago and 10,000 to 20,000 years ago) are excluded from the cross plot to focus on ice age inception and development (see fig. S8). 


drawdown of CO, during the inception and 
early development of the last ice age. It has 
been proposed that sea ice drives the initial 
CO, decline at the MIS 5e-MIS 5d transition 
by acting as a barrier to ocean-atmosphere 
gas exchange (22). However, EPICA Dome C 
sea salt-Na flux, a proxy of AZ sea ice extent, 
indicates that the expansion of sea ice at the 
end of MIS 5e is in step with Antarctic air 
temperature and predates the CO, decline 
(Fig. 3F) (42). 

The correlation of CO, with AZ 5’°Nq, weak- 
ens after the MIS 5-MIS 4 transition. In par- 
ticular, from 72,000 to 68,000 years ago, CO, 
declines sharply from 240 to 200 parts per 
million (ppm) even though AZ Nap remains 
between 4 and 5%o (Fig. 3, A and B). The sharp 
drop in CO, may be caused by dust-borne iron 
fertilization in the SAZ: At this time in Ocean 
Drilling Project (ODP) Site 1090, 8’°Ng,, iron 
flux, and biogenic fluxes all record sharp in- 
creases (J4) (Fig. 3C and fig. $5). During MIS 3 
through the LGM, both AZ Nap and SAZ 
5°Ng, exhibit millennial-scale variations that 
can explain CO, oscillations during that time 
interval (174) (Fig. 1F and Fig. 3, B and C). This 
suggests that the surface biogeochemical 
changes of the AZ and SAZ combine to explain 
much of the CO, variation during the second 
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half of the ice age, including the AIM events 
and the deglacial CO. increase (20). 

The effect of obliquity-driven mode 3 on 
Antarctic upwelling has substantial conse- 
quences for atmospheric CO.. For comparison 
with the AZ 5”°Nq, offset record, an analogous 
CO. offset record [as in (36)] is generated 
from the residuals of the linear correlation be- 
tween CO, and Antarctic air temperature (Fig. 
3E and materials and methods). The 5’°Nap 
and CO, offset records have similar features 
both between 0O and 20,000 years ago (the last 
deglaciation and the Holocene) and between 
95,000 and 140,000 years ago (the penultimate 
deglaciation and early glacial progression of 
the last ice age) (Fig. 3, D and E). These fea- 
tures are consistent with mode 3 causing delays 
in the AZ contributions to both the deglacial 
CO, rise and the CO, decline associated with 
glacial inception and progression. Similarly, 
the parallel changes in the 5”’Ng, and CO,» off- 
set records suggest that the 20-ppm increase 
in CO, during the late Holocene was caused 
by a strengthening of AZ upwelling (Fig. 3, 
A, B, D, and E) (47) due to falling obliquity 
(40) (Fig. 2E). 

Atmospheric CO, change broadly works to 
amplify the glacial cycles. Our data point to the 
response of SWW-driven AZ upwelling to 
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global climate (1.e., mode 1), along with SAZ iron 
fertilization, as central to this amplification. 
Moreover, the response of AZ upwelling to 
the bipolar seesaw (i.e., mode 2) causes the AZ 
to contribute to the CO, rises during degla- 
ciations (32). However, CO, change appears to 
lag behind other components of the forcing- 
feedback system of the glacial cycles (36), with 
notable consequences. For example, exten- 
sion of the period of strong greenhouse effect 
through the end of the penultimate intergla- 
cial and the beginning of the last ice age slows 
the global cooling, which contributes to the 
sawtooth pattern of the last glacial cycle (43). 
Similarly, gradually rising atmospheric CO, 
has been implicated in the persistent warmth 
of the Holocene, despite the orbital cues for 
preglacial cooling (44, 45). If the glacial and 
deglacial lags in CO. change are owed to the 
effect of obliquity on AZ upwelling (i.e., mode 3), 
then this partly counteracts the long-recognized 
capacity of obliquity to affect global climate 
through the extent of polar ice (46). This coun- 
teraction may cause obliquity to cede control of 
glacial-interglacial transitions to precession 
and eccentricity, thus explaining the im- 
portance of the 20,000-year and 100,000- 
year periods in the glacial cycles of the past 
~1 million years (J). 
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Cell Culture Analyzer 

Thrive Bioscience introduces CellAssist, 
which enables cell culture researchers to 
image, analyze, and document all cells, 
plates, reagents, and workflow details in a 
centralized database. CellAssist raises the 
bar for adherent cell culture by imaging and 
analyzing all the cells in a plate, not just cells 
chosen during typical manual inspection. 
Minutes after inserting a standard multiwell 
plate, CellAssist captures, analyzes, and displays hundreds of 
high-resolution images of cells at multiple magnifications, using 
phase-contrast or brightfield imaging. The visualization and analy- 
sis software allow researchers to comprehensively review images 
and metrics of the entire history of a project from their office or 
laboratory. 

Thrive Bioscience 

For info: 978-720-8044 

www.thrivebiocellassist.com 
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Exosome Purification Kit 

The Exo-spin 96 exosome purification kit from Cell Guidance Systems 
enables researchers to purify exosome samples in the standard 
96-well plate format. The kit is based on the increasingly popular 
size-exclusion chromatography (SEC) method of exosome purifica- 
tion now used by most exosome research labs. Exo-spin 96 and 
Exo-spin mini columns have identical column dimensions and resin- 
bed length, allowing scale up from small to large sample numbers 
using the same protocol, which takes less than 1 h to complete 

and allows automation in the standard 96-well format. Advanced 
manufacturing techniques and stringent quality control systems 
used in the production of Exo-spin 96 kits ensure sample-to-sample 
consistency. 

Cell Guidance Systems 

For info: +44-(0)-1223-967316 

cellgs.com/products/exo-spin-96.html 


Durable Mirrors for High-Power Lasers 

Optical Surfaces produces extremely durable, high-performance 
coated mirrors up to 600 mm in diameter for use with high-power 
femtosecond lasers. Employing the latest dielectric coating technol- 
ogy, these mirrors provide minimum pulse distortion and maxi- 
mum usable bandwidth. When used with high-power femtosecond 
lasers operating at 750 nm to 850 nm, Optical Surfaces mirrors are 
designed for use at power densities exceeding 1 J/cm? for high- 
frequency 50-fs pulses without deterioration. Produced from ma- 
terials including glass, ceramic, and silica, Optical Surfaces off-axis 
parabolic mirrors are designed to optimize the performance of 
rapid-pulsed, high-power lasers. All surface accuracies are checked 
by interferometer and are guaranteed to meet lambda/10 peak- 
to-valley (p-v) wave accuracy with low scatter. Typical applications 
for these extremely durable mirrors include plasma physics, laser 
manufacturing, and telecommunications. 

Optical Surfaces 

For info: +44-(0)-208-668-6126 

www.optisurf.com 
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Bioactive Proteins 

Abcam’s new premium-grade bioactive proteins are highly active 
growth factors and cytokines designed to meet the specific needs 
of cell culture, including cell and gene therapy and regenerative 
medicine. Sourcing high-quality proteins with batch-to-batch con- 
sistency can be a challenge, as endotoxin levels and/or nonnative 
protein conformations and posttranslational modifications can affect 
safety and performance. These proteins are manufactured in mam- 
malian cell lines under animal-free conditions, limiting endotoxins 
to ultralow levels [<5 endotoxin units (EU)/mg]. The use of mam- 
malian cell lines produces proteins with the desired conformation 
and correct posttranslational modifications for optimal bioactivity. 
They resemble the proteins’ native state, helping to ensure a safe, 
consistent outcome during drug discovery, development, and manu- 
facturing. Developed by a team of industry-expert protein scientists, 
each batch is validated using state-of-the-art technology to ensure 
integrity, purity, concentration, monodispersity, and correct folding. 
Abcam can also design, manufacture, and validate proteins to cus- 
tom needs and specifications. 

Abcam 

For info: 888-772-2226 

www.abcam.com 


Eight-Position Parallel Reaction Station 

The DrySyn OCTO Mk II from Asynt is an eight-position reaction 
Station that provides chemists with powerful magnetic stirring, 
heating, inert atmosphere, and reflux, all within a compact footprint. 
OCTO Mk Il is an ideal tool for chemists performing syntheses and 
screening reactions in parallel. Accommodating individual reaction 
volumes of up to 6 mL, this reaction station has been designed to 
use standard low-cost reaction tubes to minimize your expenditure 
on consumable items. To further increase your parallel-synthesis 
productivity, up to three stations can be used together on a single 
standard magnetic hotplate stirrer, enabling you to perform up to 24 
simultaneous reactions. 

Asynt 

For info: +44-(0)-1638-781709 

www.asynt.com 


Quantitative ELISA to Measure SARS-CoV-2 Antibodies 
EUROIMMUN, a PerkinElmer company, has launched the Anti-SARS- 
CoV-2 QuantiVac ELISA (IgG) to quantify immunoglobulin G (IgG) an- 
tibodies against the SARS-CoV-2 S1 antigen. This quantitative test kit 
applies a recombinant S1 subunit of the SARS-CoV-2 spike protein, 
enabling detection of IgG antibodies, which have been the focus of 
vaccine development programs as well as discussions about possible 
immunity in previously infected individuals. This new ELISA assay 
supports careful evaluation of an important antibody subgroup 
enabling standard curve-based quantification of the anti-S1 IgG anti- 
body concentration. EJROIMMUN has increased its ELISA production 
for COVID-19 diagnostics to more than 3 million tests per week and 
can supply large numbers of dried-blood spot collection sets for up- 
coming seroprevalence and vaccination development studies. Large 
international reference laboratories have implemented EUROIMMUN 
SARS-CoV-2 assays, and the tests are being applied in an increasing 
number of epidemiological studies worldwide. 

PerkinElmer 

For info: 800-762-4000 

www.perkinelmer.com 


Electronically submit your new product description or product literature information! Go to www.sciencemag.org/about/new-products-section for more information. 


Newly offered instrumentation, apparatus, and laboratory materials of interest to researchers in all disciplines in academic, industrial, and governmental organizations are featured in this 
space. Emphasis is given to purpose, chief characteristics, and availability of products and materials. Endorsement by Science or AAAS of any products or materials mentioned is not 


implied. Additional information may be obtained from the manufacturer or supplier. 
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Prof. Huamin Yang 
President of Changchun University of 
Science and Technology 


“As the cradle of Chinese optical tal- 
ents, CUST respects the talents, creates 
a friendly environment for talents, and 
offers spacious platforms and all-round 
resources as well as financial supports 
for talent cultivation. We sincerely wel- 
come outstanding talents from home 
and abroad to work here to achieve 


your academic ambition.” 


Changchun University of Science 
and Technology (CUST), situated by 
the scenic South Lake of Changchun, 
Jilin Province, is originally known 
as Changchun Institute of Optics and 
Fine Mechanics which was founded 
by the Chinese Academy of Sciences 
in 1958. Professor Wang Daheng, the 
academician of the Chinese Academy 
of Sciences and the Chinese Academy 
of Engineering, was the first president. 
Professor Wang’s motto of “being loyal 
and devoted, knowledgeable and pro- 
fessional, truth-seeking and innovative, 


modest and open-minded”, and the 
school motto “Ambition, Perseverance, 
Practicality and Excellence” have ex- 
erted a profound and lasting influence 
on the development of the university 
during the last six decades. 

Widely acknowledged as “the Cradle 
for Chinese Optical Talents” and the 
member of “National Basic Ability 
Construction Project of Western and 
Central China”, CUST has cultivated 
over 100,000 graduates, among whom 
are academic elites, successful entre- 
preneurs and talents in all walks of life 
contributing themselves to the devel- 
opment of the country, the revitaliza- 
tion of science and technology, and the 
advance of the nation. 

The university aims to cultivate inno- 
vative, versatile and practical talents, 
construct a comprehensive education 
system for undergraduates and grad- 
uates, with opto-elctronic technology 
being her outstanding feature, and op- 
tics, mechanics, electronics, computer 
as well as material science being its 
priority. The university coordinates the 
development of technology, science, 
arts, economics, management and law. 

CUST actively conducts scientific 
research and enhances the researches 
with fruitful achievements in laser, 
optoelectronic instruments, advanced 
manufacturing, computer application, 
artificial intelligence, optoelectronic 


function material, nanometer material, 


ADVERTISEMENT 





as well as biomedical engineering. 

The university always attaches great 
importance to international communi- 
cation and has established substantial 
co-operations with more than 90 uni- 
versities and research institutes from 
over 20 countries, including the United 
States, England, Germany and Russia, 
etc. More and more foreign scholars and 
students come to the university to con- 
duct research or study, and international 
conferences are held here annually. 
With the opportunity of promoting 
the cultivation of talents, CUST car- 
ries out the “Optical Talents Project” 
and launches a talent cultivating and 
recruiting program for the integrated 
development and prosperity. 

Facing the chances of construction 
of World First Class Universities and 
Disciplines, CUST will always spare 
no efforts to become a top university in 
China with prestigious fame all over the 
world. 

Sincerely welcome excellent talents to 
join CUST. 


Tel: +86-431-8558-2300 
Fax: +86-431-8538-3815 
E-mail: zhaopin@cust.edu.cn 


Add: 


No. 7089, Weixing Road (East campus), 
No. 7186, Weixing Road (South Campus), 
No. 7989, Weixing Road (West Campus), 
Changchun, Jilin Province, P. R. China 
Post code: 130022 
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MANGSO NOEMAL UNIVERSITY 








iangsu Normal University (JSNU) is 
one of the top universities in Jiangsu 
Province, jointly sponsored and ad- 
ministrated by China’s Ministry of 
Education and the People’s Government 
of Jiangsu Province. Its origin could 
be dated back to 1952 in WuXi City of 
Jiangsu Province. In 1958, it was moved 
north to Xuzhou, and in the next year, the 
college was founded as Xuzhou Teachers’ 
College, later was renamed as Xuzhou 
Normal University in 1996. From 2011 
onwards, it has its present name of Jiangsu 
Normal University. Over the past years, 
as it attached priority to the building of 
high-level university and connotation 
enrichment, JSNU has enjoyed a rapid 
growth in many fields. In the 2021 US 
News Rankings, JSNU ranks the 1056th 
among global best universities, and the 
102nd among China’s mainland universi- 
ties. And it ranks the 116th on the 2020 
Chinese University Evaluation Rankings 
directed by Wu Shulian and his team. 
JSNU has, in its university community, 
22 colleges, CW CHU College, School 
of Continuing Education, International 
College and “The Joint Engineering Insti- 
tute of Peter the Great St. Peterburg Pol- 
ytechnic University and Jiangsu Normal 
University” (JSNU-SPbPU Institute of 
Engineering) which was the first non-in- 
dependent legal entity for Sino-Russian 
jointly-running institute approved by the 
Ministry of Education, as well as an inde- 
pendent college — Kewen College. Now- 


QUALIFICATIONS 


adays, JSNU offers 59 undergraduate’s 
programs, 35 master’s programs at the 
first-layer of disciplines, 13 professional 
master programs, and | doctorate program 
targeted at China’s special demands, cov- 
ering 11 categories of disciplines. JSNU 
has a faculty population of 1,657 full-time 
professors, including 337 full professors 
and 584 associate professors. Among 
them, faculties with a doctorate degree 
account for 61.3%, and 43% of them 
went abroad for academic or research 
purposes. The university has 42 talents 
at state’s top-level, including research 
fellows of Chinese Academy of Sciences 
and Chinese Academy of Engineering, as 
well as “Cheungkong Scholars”. It has 
an enrollment of 19,300 full-time under- 
graduates, over 3,900 postgraduates and 
doctoral candidates, and over 440 inter- 
national students in degree programs. At 
present, six of its disciplines were rated as 
provincial prestigious disciplines and 12 
as provincial key disciplines. Its chemistry 
and engineering are listed among top 1% 
of ESI. The program of Chinese language 
and literature has blazed a trail in the field 
of Neurolinguistics with striking interdis- 
ciplinary features of arts, science and en- 
gineering. This program has won the titles 
of the national first-class program, the na- 
tional characteristic program, the national 
teaching team and the national teaching 
demonstration center, which is globally in- 
fluential. And its program of statistics has 
been granted one national key project and 


1.Abide by national laws and regulations, devoted to Chinese higher education. 
2.Have Ph.D. degree or equivalent plus postdoctoral experience. 
3.Excellent record of research accomplishments. 


CONTACT 

E-mail: rsk@jsnu.edu.cn 
Tel: (86516) 83500204 
Address : No.101, Shanghai Road, Tongshan District, Xuzhou, Jiangsu Province P.R.China 221116 


ADVERTISEMENT 


three excellent youth projects in recent 
years. And it is leading the researches of 
reliability theory and stochastic computing 
domestically, and it has made remarkable 
achievements in the national prevention 
against financial risks and the building 
of Smart Jiangsu Initiatives. And its bi- 
ology program, entrusted by the state for 
special demands, has offered a lot to the 
national system of sweet potato industrial 
technology. JSNU has established one 
“2011” coordination innovation center, 
jointly sponsored by Jiangsu Province and 
the Ministry of Education. In the past five 
years, its faculties have been awarded 2 first 
prizes of National Teaching Achievements, 
13 national key scientific research projects, 
7 Outstanding Achievements Awards of 
the Ministry of Education. And its students 
have been awarded 7 awards for special 
contribution and 6 gold awards in national 
college student competitions, including “In- 
ternet +”, “The Challenge Cup” and others. 
JSNU has been honored “National Model 
Campus” as the first group of winners 
and “National Model Organization” as 
the third group of winners. The university 
has been commended as “Jiangsu Model 
Organization” for ten consecutive times, 
“Jiangsu Model Campus” for five times, 
“Jiangsu Pacesetter Establishment” for 
five times by the CPC Jiangsu Committee 
and the Provincial People’s Government. 
It also has been awarded “National Mod- 
el Collective of National Solidarity and 
Progress” and other honors. 


APPLICATION 


Please send the scanned copy of your 
resume and relevant support docu- 
ments to email rsk@jsnu.edu.cn 


Follow us on Wechat 





online @sciencecareers.org 
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anjing Forestry University (NFU) 
was founded in 1902 with the 
merger of the Forest Depart- 





ments of Central University and 
of Jinling University. Located in the east 
of Nanjing, at the foot of beautiful Purple 
Mountain and east of Xuanwu Lake, 
NFU is a comprehensive university ad- 
ministered directly by Jiangsu Provincial 
Department of Education together with 
the Central Government. NFU has now 
been listed as one of the nation’s top uni- 
versities by China’s Double First-class 
Initiative in Universities and Disciplines 
and is jointly built by Ministry of Ed- 
ucation of China and Jiangsu People's 


Government. 


Nanjing Forestry University is seeking 
top scholars and talents around the world 
to build a better tomorrow of NFU. 


fl Eligibility Criteria 
Metasequoia Scholar - Category C 
(1) Willing to contribute to Chinese 
higher education; commitment to follow 
Chinese law and professional ethics. 

(2) Graduated with PhD degree; mini- 
mum three years Postdoctoral research 
in Overseas prestigious university/re- 
search institute, or obtained teaching or 
research position from overseas pres- 
tigious university/research institute/ 


research department of companies. 


Contact Person from HR 
For Chinese: Ms. Qian 


For foreigners: Ms. Tang 


Tel: 86-25-85420686 
Tel: 86-25-85427310 


Metasequoia Scholar - Category D 
(1) Willing to contribute to Chinese 
higher education; commitment to follow 
Chinese law and professional ethics. 

(2) Graduated with PhD degree; mini- 
mum two years Postdoctoral research 
in Overseas prestigious university/ 
research institute. The postdoctoral 
research and its duration can be dis- 
cussed for those who have acquired 
PhD degree from overseas prestigious 
university and made significant aca- 


demic achievements. 


i] Supporting Policy 
Category C 

(1) Position: the qualified candidate will 
be offered a professorship and included 
in the public institutional system. 

(2) Compensation: annual salary of 
RMB 600,000 for the first three years. 
(3) Housing support: In addition to a 
temporary apartment on campus pro- 
vided for three years, resettlement al- 
lowance of RMB 200,000 and housing 
subsidy of 1.5 million will be offered 
(excluding matching subsidy from lo- 
cal government). 

(4) Research Fund: An amount of 2 
million research fund will be provided 
(excluding matching fund from local 
government). 

(5) Academic support: Sufficient space 
for office and lab; prioritized support 


ADVERTISEMENT 


Email: gr-b@njfu.edu.cn 
Email: tzx@njfu.edu.cn 





to research team setup and master/PhD 
students enrollment. 

(6) Facilitate kids to get enrollment 
and spouse to find a proper work. 

(7) Prioritized support to apply for 
national young talent program and pro- 


vincial talent program. 


Category D 

(1) Position: the qualified candidate 
will be offered a professorship and 
included in the public institutional sys- 
tem. 

(2) Compensation: annual salary of 
RMB 300,000 for the first three years. 
(3) Housing support: In addition to a 
temporary apartment(100 on campus 
provided for three years, resettlement 
allowance of RMB 200,000 and hous- 
ing subsidy of RMB 800,000 will be 
offered (excluding matching subsidy 
from local government). 

(4) Research Fund: An amount of 1 
million research fund will be provided 
(excluding matching fund from local 
government). 

(5) Academic support: Sufficient space 
for office and lab; prioritized support 
to research team setup and master/PhD 
students enrollment. 

(6) Facilitate kids to get enrollment 
and spouse to find a proper work. 

(7) Prioritized support to apply for pro- 
vincial talent projects. 


Disciplines open and 
contact information 
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he HIT Center for Life Sciences 

(HCLS) was founded in 2016. As the 

first special academic zone of Harbin 

Institute of Technology (HIT), HCLS 
is empowered by unparalleled freedom in 
research, graduate program, personnel em- 
ployment and financial flexibility within the 
university. According to 2019, 2020, 2021US 
News Rankings of Best Global Universities, 
HIT’s Biology and Biochemistry was ranked 
in the top ten in China. 


HCLS is seeking outstanding PIs from all 
over the world. HCLS offers an internation- 
ally competitive start-up package and multi- 
disciplinary innovative environment. The 
positions are open to all biological disciplines 
in Neuroscience, Microbiology and Immunol- 
ogy, Biophysics and Structural Biology (esp. 
Cryo-EM), Metabolomics, Chemical Biol- 
ogy, Nanotechnology and Tissue Engineer- 


EVALUATION 

The initial appointment for PIs will be 5 years 
with full funding support from HIT. Perma- 
nent position will be obtained upon passing 
reviews by an international committee. 


nner Mongolia University (IMU) is 

located in Hohhot, the capital of Inner 

Mongolia Autonomous Region and 

a famous historic and cultural city 
with a distance of 400 kilometers from 
Beijing. As the first comprehensive uni- 
versity in ethnic minority region after the 
founding of the People’s Republic of Chi- 
na, IMU was established at a high start- 
ing point accomplished at one go in 1957 
coinciding with the 10th anniversary of 
the founding of the Autonomous Region. 
With Ulanhu, the then-vice-premier of 
the State Council and governor of Inner 
Mongolia as the first president of IMU 
and a high-level faculty at the beginning 
stage of IMU consisting of discipline 
leaders and academic backbones selected 
by the CPC Central Committee and the 
central government from a dozen or so 
prestigious universities across China, in- 
cluding Peking University, who brought 


Tel: 0471-4992415 / 4995765 





PE Tenure-track Positions at 
mvt The — | Center for Life Sciences, ONE 


ing, Proteomics, DNA Damage and Repair, 
Genome Editing and translational research. 
HCLS is committed to being a world-class 
fundamental life science research institution. 


QUALIFICATIONS 

1.Ph.D. or M.D. degree or equivalent plus 
postdoctoral experience. 

2.Excellent record of research accomplishments. 
3.Great potentials in biomedical fields. 


PACKAGE 

1.International competitive salary: 
500,000-750,000 CNY (tax included). 
2.Sufficient Start-up fund: 

10,000,000 CNY (2,000,000 CNY per annum 
for laboratory personnel, equipment, con- 
sumables and reagents as needed. No need to 
apply for additional funds during the appoint- 
ment). 

3.Better housing security: 


HOW TO APPLY 
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A temporary furnished apartment (50-70 m2) 
and settlement allowance(Level A: 1,100,000 
CNY, Level B: 600,000 CNY) will be provided. 
4.Spacious Lab space and office space: 

150 m2 or larger. 

5.Guaranteed graduate student quota: 
Each PI will allowed to accept up to 1-2 Ph.D 
students and 2-3 M.S. students. Postdoctoral 
and technical personnel can be recruited in- 
dependently. 

6.Advanced education resources: 

HCLS will assist with arranging daycare or 
school for child and job for spouse. 

7.Solid medical security services: 

There are Harbin Institute of Technology 
Hospital and 2 affiliated hospitals to the Uni- 
versity which are Heilongjiang Provincial 
Hospital, Harbin First Hospital. 


Note: For candidates with exceptional accom- 
plishment, the package can be negotiable. 


Applications in English as PDF files including a cover letter, a full CV, publication list, three refer- 
ees and a statement of future research plans should be sent to Prof. 
Zhiwei Huang (huangzhiwei@hit.edu.cn), and indicate the “PI application” in the email subject. 


The application will open until filled. 


with them long-accumulated academic 
traditions and sublime spirits of univer- 
sities, a solid foundation for running the 
university was laid, and healthy academic 
and campus atmosphere were introduced 
to IMU. IMU was approved to enroll 
graduate students in 1962, designated as 
a key university in China in 1978, author- 
ized to confer doctoral degrees in 1984, 
included into Project 211 for the building 
of key universities in 1997, listed as a 
university co-sponsored by Inner Mon- 
golia and the Ministry of Education in 
2004, incorporated into the state program 
for building one high-level university in 
each of the provinces in the Mid-west- 
ern Region in 2012, selected as one of 
the universities in China for the Double 
First-Class initiative in 2017, and ap- 
pointed as a university of co-sponsorship 
by the Ministry of Education and Inner 
Mongolia, i.e. a university sponsored by 


Email: ndszk@imu.edu.cn 





the Ministry and included into the rank of 
universities under the administration of 
the Ministry, in 2018. 


= RECRUITMENT 

To speed up the building process of 
the Double First-class Initiative, IMU 
has implemented the Steed Plan to en- 
hance the introduction and cultivation 
of high-end academic talents and the 
Outstanding Talents Introduction Plan to 
introduce talents from home and abroad 
in large numbers. For the five years to 
come, IMU is ready to offer more than 
400 vacancies for academic leaders, 
backbones and outstanding talents, pro- 
vide first-class academic environment, 
resources, services and benefits for them 
and build with concerted efforts an ideal 
platform for career development, inno- 
vation and the realization of academic 
aspirations. 


Website: www.imu.edu.cn 
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ounded in 1958, 
Jiangxi University 





of Science and Tech- 

nology (JXUST) was 
once attach to the Ministry of 
Metallurgical Industry China 
and China National Nonferrous 
Metals Industry Corporation. In 
2013, it was jointly-sponsored 
by the Ministry of Education, the 
Ministry of Industry and Infor- 
mation Technology, and Jiangxi 
Provincial Government. JXUST 
has been authorized by the State 


Council of PRC to grant doctor- 
ate, master and bachelor degrees. 
The School of Information Engi- 
neering, JXUST was established 
in April 2001. JXUST is also 
one of the earliest universities in 
Jiangxi Province to conducting 
research in computer science 
and technology. The discipline 
of applications in computer 
technology in JXUST is now as 
a distinctive feature and brand 
at Jiangxi Province level. It was 
rated as one of the construction 


Projects for National First-class 
disciplines in 2009. Sixty years 
since the foundation, JXUST 
always keeps the school motto 
“Aim for the Highest and Take 
Responsibility as the First” in 
heart, and continuously promotes 
the construction of high-level tal- 
ents team, consistently conducts 
the strategy of strengthening the 
university by talents, and try the 
best to achieve intensive develop- 
ment, characteristic development, 
and transformational develop- 
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ment. The mission of JXUST 
aims to become an international 
well-known university in science 
and technology that has excellent 
academic level and encourages 
innovation and creativity. 


Recruitment disciplines 
optical engineering, electronic 
science and technology, infor- 
mation and communication 
engineering, computer science 
and technology, software engi- 
neering 


orestry University 
+- Special Job Fair 
fletasequoia Scholars” 





KEYWORDS 

Forestry, Plant Protection, Horticulture, Grass Science, Forestry 
Engineering, Material Science and Engineering, Chemical En- 
gineering and Technology, Biochemical Engineering, Applied 
Chemistry, Light Industry Technology and Engineering, Land- 
scape Architecture, Garden Plants and Ornamental Horticulture, 
Mechanical Engineering , Control Science and Engineering, 
Civil Engineering, Transportation Engineering, Biology, Ecolo- 
gy, Environmental Science and Engineering, etc. 


REQUIREMENTS 
1. Graduated from a domestic “double first-class” university or 
a famous overseas university. 


HOW TO APPLY 
Byes Ge 


2. Two years of postdoctoral or 
work experience overseas. 


CONTACT US 


Wechat: 13810231645 





2020 "Zhihui Chengyang" High-level Talent Fo- 
rum & Qingdao University International 
Scholars Laoshan Forum 


DISCIPLINES 


History, Economics, Statistics, Law, Marxist Theory, Philos- 
ophy, Political Science, Sociology, Pedagogy, Psychology, 
Physical Education, Language and Literature, Drama and Film 
and Television, Management, Journalism and Communication, 
Art, Mathematics, computer science, mathematics, textile en- 
gineering, medicine, physics, oceanography, chemistry, biolo- 
gy, food science and engineering, mechanical engineering, etc. 


QUALIFICATIONS 


Outstanding scholars who have obtained a doctorate degree 
from a well-known university or academic institution at home 
and abroad, have the ability to independently conduct scientif- 
ic research, and have representative results in the research field 
of the subject and development potential. 


CONTACT US 


Tel (wechat): 13810231645 
For more info: Scan to apply 
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anjing University of Information Science & Technology 
(NUIST), founded in 1960 and renamed from Nanjing Insti- 
tute of Meteorology in 2004, was designated as one of the 
key institutions of higher education in China in 1978, which 
enjoys the reputation as the cradle of meteorological talents and 
has worldwide influence in the field of Geoscience. In 2017, the uni- 
versity was listed in China’s national Double First-class universities 
and disciplines and was also included in the provincial high-level 
universities in Jiangsu province. NUIST is co-constructed by the 
Jiangsu Provincial People’s Government, the Ministry of Educa- 
tion of China (MOE), the China Meteorological Administration 
(CMA), the State Oceanic Administration (SOA) and the University 
of Chinese Academy of Sciences. NUIST is located in Nanjing Ji- 
angbei New Area (National Level New Area). 

After nearly 60 years of development, NUIST has established a 
comprehensive and coordinated discipline system of science, engi- 
neering, management, art and humanities, with atmospheric science 





as its core and information engineering as its basis. In recent years, 
the comprehensive rankings of NUIST in the four main interna- 
tional university rankings have promoted rapidly. In 2020, NUIST 
is ranked 40-58, 39 and 61 in US News, ARWU and QS rankings 
respectively. At present, NUIST has 6 doctoral programs, 22 mas- 
ter programs, 14 professional master programs and 71 bachelor pro- 
grams. The discipline of Atmospheric Science ranks No. | in China. 
The four disciplines of Geoscience, Engineering, Computer Science 
and Environmental Science and Ecology rank top 1% ESI. 


RECRUITMENT TYPES 

(1) Type A 

Academicians of Chinese Academy of Sciences or Chinese Acad- 
emy of Engineering; Academicians from the National Academy of 
Sciences/Engineering from developed countries such as USA, UK, 
Canada, and Australia; Laureates of world renowned awards such 
as Nobel Prize, Carl-Gustaf Rossby Research Medal, Turing Award, 
and Field Medal; outstanding international scholars working in top 
universities or research institutions. 

(2) Type B 

Distinguished Professors of MOE, Winners of National Science 
Fund for Distinguished Young Scholars, National Distinguished Ex- 
perts, Chiefs of NSFC Innovative Research Group, the first person of 


Contact information of HR of NUIST: 





the 2nd Prize Project of the National Natural Science Awards/the 1st 
Prize Project of the State Technology Invention Awards/the Ist Prize 
Project of the National SCI-tech Progress Awards, National Teaching 
Masters, National key discipline leaders, Members of the Chinese 
Academy of Social Sciences, Tenured professors from renowned 
overseas universities. Applicants should be under the age of 50. 

(3) Type C 

National Young Scholars, Winners of National Science Fund for 
Outstanding Young Scholars, Young Scholars of MOE, Youth Top- 
notch Talent of Organization Department of the CPC Central Com- 
mittee. Applicants should be under the age of 40. 

(4) Type D (candidates for Type A, B and C) 

The applicants who are not eligible for Type A/B/C but have poten- 
tial to successfully apply for the four types of Youth Introduction 
Programs or other national talent introduction programs could be 


employed as backup talents. 


SALARIES AND BENEFITS FOR 
HIGH-LEVEL TALENTS (UNIT: RMB) 


Settling-in 


Annual salary | allowance and housing | Start-up fund 
subsidies 





5-20 million settling-in 


1.5 millionor allowance; transitional 
TypeA 


more housing or rental 
subsidy 
No less than 2 million 1-10 million 
l millionor | settling-in allowance; | (0.5 million or 
Type B i 
more transitional housing or more for 
rental subsidy humanities) 
over 1.2 million 
800,000 or settling-in allowance; 
Type C 
more a temporary 
accommodation 
Over 0.6 million 
Tvpe D 300,000- settling-in allowance; | 0.5 million or 
ae 600,000 a temporary more 


accommodation 


High-level talents:Tel: 86-25-58731027 (Ms.DING Ni), email: gdrc@nuist.edu.cn 


Excellent teachers, Hong Kong, Macao and Taiwan teachers, and Professional foreign Teachers (non-ethnic-Chinese): 
Tel: 86-25-58235197 ( Ms. HUA Rong), email: zo@nuist.edu.cn 
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Northeast Agricultural University Welcomes Overseas Talents 


ortheast Agricultural University 
(NEAU) is a national “211 Pro- 
ject” key construction university 





and a “World-class Discipline” 
construction university with agricultural 
science as its advantage, life science and 
food science as its characteristics, and 
multiple disciplines such as agriculture, 
industry, science, economics, manage- 
ment developing in a coordinated way. 
NEAU is located in Harbin, Heilong- 
jiang Province of China, and was es- 
tablished in 1948, which covers an area 
of 4.96 teaching faculties, possesses 3 
national-level key disciplines and 3 na- 
tional-level key (nurturing) disciplines, 
4 disciplines stepped into the top 1% of 
ESI international disciplines ranking, 10 
first-class subjects authorized to confer 
doctoral degree, and has 10 postdoctoral 
research mobile stations and 3 postdoc- 
toral research stations. 

In recent years, NEAU attaches great 
importance to the work of talents intro- 
duction and education,vigorously im- 
plements the strategy of "strengthening 
NEAU construction by emphasizing 
talents", and provides good development 
platforms and conditions for outstanding 
talents. Now we sincerely invite a group 
of leading talents of certain disciplines 
as well as other outstanding talents from 


overseas to join us. 


Contact persons: Mrs. Huang and Mr. Guan 


i |. Recruitment disciplines 
Animal Husbandry, Veterinary Science, 
Food Science and Engineering, Agricultural 
Engineering, Biology, Crop Science, Horti- 
culture, Agricultural Resources and Environ- 
ment, Agricultural and Forestry Economic 
Management, Grass Science, Chemical En- 
gineering and Technology, Water Conserv- 
ancy Engineering, Ecology, Plant Protection, 
Landscape Architecture, Environmental 
Science and Engineering, etc. 


Jl li. Requirements 

1. Support development of the People's 
Republic of China, abide by teachers' 
professional ethics, have the spirit of hard 
work and dedication, and ensure full-time 
work in NEAU. 

2. Have more than 2 years of overseas 
study or research experience and have 
obtained a doctorate degree; professors 
and equivalents from famous overseas 
universities, research institutes or well- 
known enterprises research and develop- 
ment institutions will be preferred. 


J ill. Remuneration 

1. The recruited talents will be paid RMB 
300,000-1 million/ year, according to their 
work performance or taking reference to 
their work situation abroad to determine 
via consultation, RMB 300,000-1 mil- 
lion as allowance for home settling-in, 


Tel: (+86)451-55191982 / (+86)451-55190332 





RMB 500,000-10 million as start-up funds 
for supporting scientific research, and be 
offered a turnover apartment. High-level 
talents will be assisted to arrange work for 
his/her spouse and children’s admission to 
both pre-school and school. 

2. The recruited talents will enjoy a set of 
supporting facilities for scientific research, 
independently determine the research di- 
rection, organize research teams, and use 
research funds within the specified scope 
of responsibility; high-level personnel will 
be equipped with 1-2 scientific research 
assistants. 

3. The recruited talents will be given 
related certification of their professional 
and technical posts. 

4. Other unmentioned issues can be fur- 
ther negotiated. 

5.High-level leading talents can negotiate 
the above conditions one by one in details. 


fil IV. Application methods 

We invite qualified talents for interviews 
and discuss related matters. 

Email: rsc@neau.edu.cn 

Address: HR Department, Northeast 
Agricultural University, 600 Chang- 
jiang Road, Xiangfang District, Harbin 
City, Heilongjiang Province. 

Postal code: 150030 

NEAU: www.neau.edu.cn 

Contact persons: Mrs. Huang and Mr. Guan 


QQ: 83169446 / 254508279 





Heilongjiang University of Science and Technology 
Is Recruiting Academic Faculty! 


eilongjiang University of 
Science and Technology 
is the only university 
in Heilongjiang, Jilin, 
and Inner Mongolia area which 
is characterized by mining en- 
gineering. Jointly sponsored by 
Heilongjiang Province People's 
Government and Ministry of 
Emergency Management (State 
Administration of Work Safety), 
it is praised as “outstanding uni- 
versity” by the higher education 
evaluation of Ministry of Edu- 
cation. It is the sole university 
which has doctoral-level talent 
program that respond to national 
special requirements“emergency 
rescue and influence control for 
coal mine accident", a university 
attached to post-doctoral research 
center for first-class discipline of 
safety science and engineering. 


REQUIREMENTS 

1. Candidates should have the na- 
tionality of the People's Republic 
of China, abide by discipline and 
law, have good ideological and 
political morality, have strong 
dedication and sense of respon- 
sibility, can devote themselves to 
teaching and research work. 

2. For doctoral candidates, the 
first degree should be a full-time 
unified enrollment bachelor's de- 
gree from public university (ex- 
cluding top-up), Master's degree 
and doctor's degree should be a 
full-time unified enrollment post- 
graduate. Graduates from schools 


in Hong Kong, Macao, Taiwan 
and foreign countries should 
have a certificate of academic 
degree issued by Chinese Service 
Center for Scholarly Exchange 
of the Ministry of Education. 


RECRUITMENT PLAN 
HLJUST plans to recruit 102 
doctors in 2020. Recruiting 
categories include: Mineral En- 
gineering, Surveying and Map- 
ping, Geological Resources and 
Geological Engineering, Chemi- 
cal Engineering and Technology, 
Electrical Engineering, Control 
Science and Engineering, Me- 
chanical Engineering, Informa- 
tion and Communication Engi- 
neering, Electronics Science and 
Technology, Materials Science 
and Engineering, Architecture, 
Civil Engineering, Computer 
Science and Technology, Sci- 
ence of Business Administration, 
Science of Public Management, 
Applied Economics, Sociology, 
Marxist Theory, Foreign Lan- 
guages and Literature, Mathe- 
matics, Physics and mechanics. 


REMUNERATION AND 
BENEFITS 

The employed candidates will 
be managed as public institution 
staff of Heilongjiang Province, 
and shall enjoy the correspond- 
ing remuneration as public insti- 
tution staff and school faculty. 

1. Leading talents (full-time) 
(1) Settling-in allowance: 


2,000,000 yuan RMB; 

(2) Housing subsidy: 1,000,000 
yuan RMB; 

(3) Cost-of-living allowance: 
10,000 yuan RMB per month; 
(4)Start-up fund and annual 
salary: negotiable according to 
actual demand; 

(5)Families: can be settled ac- 
cording to actual demand. 

2. Outstanding talents (full- 
time) 

(1) Settling-in allowance: 
1,000,000 yuan RMB; 

(2) Housing subsidy: 500,000 
yuan RMB; 

(3) Cost-of-living allowance: 
5,000 yuan RMB per month for 
5 consecutive years; 
(4)Start-up fund and annual 
salary: negotiable according to 
actual demand; 

(5)Families: can be settled ac- 
cording to actual demand. 

3. Topnotch talents (full-time) 
(1) Settling-in allowance: 
500,000 yuan RMB; 

(2) Housing subsidy: 300,000 
yuan RMB; 

(3) Cost-of-living allowance: 
3,000 yuan RMB per month for 
5 consecutive years; 

(4) Start-up fund and annual 
salary: negotiable according to 
actual demand; 

(5) Families: can be settled ac- 
cording to actual demand. 

4. Provincial high-level talents 
(full-time) 

(1) Settling-in allowance: 
200,000 yuan RMB; 





(2) Housing subsidy: 100,000 
yuan RMB; 

(3) Cost-of-living allowance: 
2,000 yuan RMB per month for 
5 consecutive years; 

(4) Start-up fund and annual 
salary: negotiable according to 
actual demand; 

(5) Families: can be settled ac- 
cording to actual demand. 

5. Excellent Doctor (full-time) 
(1) Settling-in allowance: 
50,000-200,000 yuan RMB; 

(2) Start-up fund (Science & 
Engineering): 

30,000-100,000 yuan RMB; 
(3)Start-up fund (Humanities & 
Social Science): 

15,000-50,000 yuan RMB; 

(5) Doctor's allowance: 

600 RMB yuan per month. 
HLJUST will give priority to 
those whose spouse needs to 
be settled in.The spouse will 
be recruited through open re- 
cruitment if he or she meets the 
requirements. It is negotiable 
according to actual demands. 


HOW TO APPLY 
E-mail address for application: 
hljkjdxrsc@163.com 


CONTACT 


Mr. Wang, Mr. Han 

Tel: +86(451) 88036082 

Official Website: www.usth.edu.cn 
Address: No. 2468, Puyuan Road, 
Songbei District, Harbin 150022 
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Multidisciplinary positions in Wenzhou Medical University 


enzhou Medical Univer- 
sity (WMU) is a higher 
institution co-developed by 





the Ministry of Education, 
National Health and Family Planning 
Commission and Zhejiang Provincial 
Government and a Priority Develop- 
ment University in Zhejiang Province. 
WMU offers 4 PhD Programs and 1 
Professional PhD Programs in Clinical 
Medicine, Pharmacy, Biomedical Engi- 
neering and Basic Medicine, 12 Master 
Programs, 8 Professional Master’s Pro- 
grams, and 27 undergraduate programs 
in Medicine, Science, Engineering and 
Management. WMU has postdoctoral 
research centers under the Primary Cat- 
egory Discipline of Clinical Medicine 
and Pharmaceutical Science. WMU 
ranks 74th in the overall ESI rank- 
ing nationally. 7 programs including 
Clinical Medicine, Pharmacology and 
Toxicology, Biology and Biochemistry, 
Molecular Biology and Genetics, Mate- 
rials Science, Chemistry, Neuroscience 
and Behavioristics rank within the glob- 
al top 1% in ESI rankings, of which the 
Clinical Medicine Program ranks within 
the global top 0.2% in ESI rankings. 
At present, the total enrollment of 
WMU amounts to 13,328, among which 
890 are full-time international students. 
The graduates from the Clinical Medi- 
cine Program have consistently ranked 
within 10% nationwide in terms of their 
passing rate of National qualification 
examination for medical practitioners. 
WMU has always been tapping its 
great potential for basic research and 


applied basic research to push itself to 
the forefront of international scientific 
research and to meet the needs of local 
development. WMU has high-level sci- 
entific research platforms and abundant 
clinical resources: 1 State Key Labora- 
tory jointly built by Zhejiang Province 
and the Ministry of Health, 1 National 
Engineering Research Center, 1 Nation- 
al Clinical Research Center (for Oph- 
thalmic Diseases), 2 China International 
Science and Technology Cooperation 
Bases, and 1 National Center for Inter- 
national Research; 5 directly-affiliated 
hospitals with more than 10 million out- 
patients and emergency patients visits 
per year and 3 of directly-affiliated hos- 
pitals are among the top 100 hospitals 
selected by Chinese Academy of Med- 
icine in 2019 based on their technology 
impact. WMU has 25 non-directly affil- 
iated hospitals (clinical colleges), which 
provide a solid foundation for medical 
teaching and research. For more infor- 
mation, please visit http://www.wmu. 
edu.cn/index.htm. 


All-year-round recruitment 

WMU is currently recruiting multi- 
disciplinary faculties at several levels: 
discipline leading talents, discipline 
leaders, academic leaders, academic 
backbones and excellent doctoral and 
post-doctoral fellows. WMU provides 
a competitive salary, reward for sci- 
entific research, scientific research 
start-up funds, settling-in allowance, 
housing subsidies and medical insur- 


ance, etc. For more details of position 





description, please visit http://rsc.wmu. 
edu.cn/info/1043/2667.htm. 


Research disciplines 

(including but not limited to) 
Medical sciences, life science related 
disciplines, emerging interdisciplinary 
fundamental research, frontier research 
and translational research, etc. 

1. Medical Sciences: clinical medi- 
cine, pharmaceutical science, basic 
medicine, public health and preventive 
medicine and medical technology, etc. 
2. Sciences: life science, biology, physics, 
optics, chemistry and mathematics, etc. 

3. Engineering: optical engineering, 
materials science and engineering, 
biomedical engineering, electronic sci- 
ence and technology, information and 
communication engineering, computer 
science and technology and environ- 


mental science and engineering, etc. 


Application procedure 

Please send your application including 
CV, research plan, cover letter and re- 
lated supporting materials to rscrck@ 
wmu.edu.cn. Please name your email 
after “name + University”. Review of 
applications will start upon receipt. 


CONTACT 

Human Resource Department, Wen- 
zhou Medical University 

Tel. and fax: 0577-86689737 

Email: rscrck@wmu.edu.cn 

Official Website: www.wmu.edu.cn 
Address: University Town, Wenzhou, 


Zhejiang, China 
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Are you an early career scientist looking for your first aa i 
investigator role? Or mid-career searching for a close-knit 
institute that provides scientists exceptional support? If so, we 


want to hear from you! 


Van Andel Institute is hiring! We are 
a global leader in cancer epigenetics 
and Parkinson's disease research, with 


complementary strengths in metabolism, 


structural biology and cell biology. 

An innovative hub for exceptional 

basic research, we engage the world’s 
brightest minds to pursue bold ideas. 
Our scientists translate their discoveries 
through partnerships with academic, 
clinical, industry and philanthropic 
collaborators. Students in our research- 
intensive Graduate School complete 
their dissertation work in one of our 

32 faculty-led labs. 


AA E A 

six assistant professor positions 

in the departments of Epigenetics, 
Neurodegenerative Science, Metabolism 
and Nutritional Programming, Structural 
Biology and Cell Biology. If your research 
is in or related to one of these fields, we 
welcome your application. We also will 
consider strong applicants at the level of 
associate or full professor. 


What we offer 


A robust research environment: 32 labs, eight state-of-the-art 
cores, an accredited Graduate School and more than 400 dedicated 
Scientists, educators and support staff. 


Faculty support: 100% VAI-paid salary plus $65,000 annually in lab 
funds, a comprehensive start-up package, lab and home relocation 
services, administrative support, salaries for 1.5 graduate students, 
Subsidized high-performance computing and Core Technologies 
and Services (including 60 hours/year of paid bioinformatics/ 
biostatistics time), and internal funding programs. 


Comprehensive benefits: Health, vision, dental, life and long-term 
care insurance, six weeks of parental leave, 401(k) with match 
and more. 


One-of-a-kind operations and administrative support: 
Infrastructure designed to support science and give our 
investigators more bandwidth for research, and expert support 
teams including research operations, clinical collaboration 
management, high-performance computing, lab supply center, 
scientific events, grants/budget support, communications and 
marketing, philanthropy, on-site library services, glassware, 
business development and technology transfer. 


A “Top 10” community: Located in the heart of the Medical Mile in 
downtown Grand Rapids, Michigan — a city consistently included 
on “Top 10” lists for raising a family, growth and much more. 


Learn more and apply at 
vai.org/futurefaculty 


CSIR-CENTRAL DRUG RESEARCH INSTITUTE 2+. 


(Council of Scientific & Industrial Research) 
Sector 10, Jankipuram Extension Sitapur Road, Lucknow-226031, India 
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Career Opportunities for Scientist 
SCIENTIST RECRUITMENT-2020 
Advertisement No 10/2020 reveals the top companies in biotech 
A unique opportunity for research career in & pharma voted on by Science 
Science & Technology in the area of een tT 
Drugs, Pharmaceuticals and Biomedical Research 


D| 


Read the article and employer profiles Science 2020 
at sciencecareers.org/topemployers TOP a PLOYER 


CSIR-Central Drug Research Institute, Lucknow, a premier Institute 
under Council of Scientific and Industrial Research (CSIR), is involved in 
multidisciplinary R&D programs of both basic and applied nature. 

With the aim to expand and strengthen the areas of Neurobiology, 
Virology, Parasite Biology, Endocrinology, Human Genetics, Analytical 
Biochemistry, Computational Chemistry, Medicinal Process Chemistry, 
Statistics, IPR & HR Management, the Institute is looking for 
enthusiastic, talented young researchers / professionals with brilliant 
academic record, proven scientific achievements and zeal to conduct 
research as per the mandate of the Institute aligned with National 
Missions. Due weightage will be given to the candidates involved in the 
product development/ technology innovation/applied technology / 
translation research. The positions offer exciting opportunities for career 
growth as per the Flexible Complementing Scheme for Scientists. 

Applications are invited from motivated young Indian Nationals with 
requisite qualification and experience for the posts of Principal Scientist 
(1 post), Senior Scientist (8 posts) and Scientist (8 posts). 


For detailed advertisement and submission of application, please 
visit https://recruit.cdri.res.in or access the link 'SCIENTIST 
RECRUITMENT-2020' on CSIR-CDRI Website https://cdri.res.in 








Start date for registration and online application: 05/12/2020 
Last date for registration and submission of application: 15/01/2021 


Last date of receipt of physical copy of application at CSIR-CDRI: 01/02/2021 








Northeastern University 
College of Engineering c 
With over 185 tenured/tenure-track faculty, areers for you, by you. 


and 18 multidisciplinary research centers and FROM THE JOURNAL SCIENCE PAVAAAS Pe. 
institutes, and funding by eight federal agencies, = 


myIDP: 


A career plan customized 












Northeastern’s College of Engineering is ina 

period of dynamic growth. Our emphasis on 
interdisciplinary, transformative and innovative 
research—tied to Northeastern’s unique history 

of industry collaboration through the university's 
signature cooperative education program—enables 
partnerships with academic institutions, medical 
research centers, and companies near our centrally 
located Boston campus and around the globe. 


Learn more and apply at 
coe.northeastern.edu/faculty-hiring 


Consideration will be given to candidates at the 
assistant, associate, and full professor levels; 
successful applicants will be expected to lead 
internationally recognized research programs 
aligned with one or more of the college’s strategic 
research initiatives. We are also seeking to recruit 
and support a broadly diverse community of faculty 
and staff and strives to foster an inclusive culture 
built on respect that affirms inter-group relations and 
builds cohesion. Applicants will be asked to submit 
a diversity statement discussing how they view their 
contributions to sustainment and improvement of 
diversity in the college and community at large. 
Northeastern University is an equal opportunity employer, seeking 
to recruit and support a broadly diverse community of faculty and 
staff. Northeastern values and celebrates diversity in all its forms 
and strives to foster an inclusive culture built on respect that affirms 
inter-group relations and builds cohesion. 

All qualified applicants are encouraged to apply and will receive 
consideration for employment without regard to race, religion, color, 
national origin, age, sex, sexual orientation, disability status, or any 
other characteristic protected by applicable law. 

To learn more about Northeastern University’s 

commitment and support of diversity and inclusion, 

please see northeastern.edu/diversity. 
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Science 


Features in myIDP include: 


= Exercises to help you examine your 
Skills, interests, and values. 


« Alist of 20 scientific career paths 
with a prediction of which ones best 
fit your skills and interests. 


Visit the website and start 


Dp planning today! 
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Goverment Of India 


APPOINTMENT OF DIRECTOR, IIT INDORE AND IIT MANDI 


Ministry of Education 
Department of Higher Education 


Technical Section - I 


Applications are invited for appointment to the post of Director of Indian Institute of Technology (IIT) at Indore and Mandi. The Director of an I IT 
is the academic and administrative head of the Institution. He/she is expected to have a minimum of 5 years’ administrative experience and leadership 
qualities to head an Institute of National importance. The candidate/person should be a Ph.D. with first class or equivalent at the preceding degree, 


preferably in a branch of Engineering. In exceptional cases, candidates with Science, Mathematics or Management degrees may be considered. He/ 


she should have an outstanding academic record throughout and a minimum of 10 years teaching experience as a Professor in a reputed Engineering or 
Technology Institute or University and should have guided Ph.D. students. The applicant should preferably be less than 60 years of age on the last date 
of receipt of the applications. The post carries a fixed pay of Rs. 2,25,000/- (Revised) per month, with allowances as per rules. 


2. Interested individuals may apply giving their detailed resume in the prescribed format clearly bringing out research, teaching, industry-academia 
collaborations and administrative achievements, alongwith a two-page justification in support of their candidature, a two-page vision statement for 
the institution and contact details of at least two distinguished individuals well acquainted with their work. The application typed in the prescribed 
format along with enclosures may be sent by Registered/Speed Post to The Under Secretary (TS.1), Department of Higher Education, Ministry of 
Education, Room No. 428 “C” Wing, Shastri Bhawan, New Delhi -110 001 so as to reach the Ministry on or before 31st December, 2020. 


The detailed advertisement and the format of application is available on the website (www.education.gov.in). 
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WORKING LIFE 


By Chee Kiang Ewe 
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Dare to be different 





When I came to the United States 
for graduate school, I changed 
my identity again. When my lab- 
mates made references to clas- 
sic American films or books that 
I had never heard of, I nodded 
knowingly to hide my ignorance. 
I learned how to make small 
talk. I began to introduce myself 
exclusively as Ethan—my adopted 
American name—instead of my 
formal “ethnic-sounding” name, x Ke 
Chee Kiang. These acts took ef- = E ko 
fort that I could have spent on i IE 
my studies, but I didn’t see any 
other option. 

My perspective began to change 
when I was a teaching assistant 
(TA) in my second year of grad 
school. My students’ evaluations 
of me were filled with negative 
comments about my accent while 
saying nothing about my teaching. I was embarrassed, but 
this time I resolved to handle the situation differently. 
My former impulse to focus on fitting in largely stemmed 
from my lack of self-confidence, but I had grown over the 
years. I had altered my identity twice; I was not going to 
do it again. 

So, instead of changing my accent, I learned to slow the 
pace of my speech and to reiterate important points. Many 
of my students became more engaged. They started to show 
up to my office hours, seek me out for help, and strike up 
conversations when they saw me outside the classroom. 
I realized that I did not need to change my accent to be 
understood. Assimilation was not the only option. 

Since then, I have stopped suppressing my identity as 
a first-generation, international student and instead cel- 
ebrate it as a strength. Because of the extra miles I have 
covered, I am resourceful. Because of the additional hurdles 
I have overcome, I am persistent and resilient. 

Throughout my academic career, I have felt I must work 
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“have stopped suppressing 
my identity as a first-generation, 
international student.” 





itting in the giant lecture hall on my first day of university, I felt terribly out of place. As the 
first and only member of my family to attend college, leaving my home in rural Malaysia to 
study in New Zealand was exciting—and terrifying. I wished I was as confident and articulate 
as my classmates. In the years that followed, I coped by assimilating, relinquishing parts of 
my old self and remodeling my identity to blend in. I tuned my accent to sound like my class- 
mates. I lied that my uncle was a doctor. I made friends, but consciously changing myself was 
mentally and emotionally draining. It took me a long time to see there could be another way. 


extra hard to be seen and heard. 
This pressure can be taxing. But it 
also spurs me to seize opportuni- 
ties and take on responsibilities 
that help me grow as a scientist 
and professional. I watch research 
seminars on YouTube when I have 
time to fill, host additional office 
hours as a TA, and volunteer to or- 
ganize departmental retreats and 
research symposiums. It isn’t easy, 
but I have learned so much along 
the way. 

I want underrepresented stu- 
dents like me to have a fighting 
chance in the academic world— 
and now I have the confidence to 
help make that happen. During 
the summer, U.S. Immigration 
and Customs Enforcement threat- 
ened to expel some international 
students amid the COVID-19 pan- 
demic. Like many other international students and schol- 
ars in the United States, I felt unwanted and betrayed. So 
I took action. I sent an email to my department chair sug- 
gesting that this year’s retreat highlight the importance of 
diversity—in race, ethnicity, gender, sexuality, nationality, 
socioeconomic status, and more. Much to my delight, my 
proposal was accepted. Over the 2-day meeting, students 
and faculty members had fruitful conversations about in- 
equities in academia and celebrated the beautiful science 
conducted by diverse researchers from within and outside 
our campus. 

I am still stumbling through the mysterious, labyrin- 
thine world of academia, but now with more confidence 
than the insecure 20-year-old who thought he did not be- 
long in that lecture hall. I am not the same as all of my 
peers—and that’s OK. 


v 





Chee Kiang Ewe is a Ph.D. candidate at the University of California, 
Santa Barbara. Send your career story to SciCareerEditor@aaas.org. 
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